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"To complete the process of intellectual maturation, every 
student should be required to apply what he or she has 
learned to an economic problem and, in the process, acquire 
experience really 'doing economics. ’ For a particular 
intellectual encounter to accomplish this goal, it should 
involve considerable responsibility on the student's part for 
formulating questions, gathering information, structuring and 
analyzing information, and drawing and communicating 
conclusions to others in an oral and/or written form ." 


-SIEGFRIED ETAL. (1991) 


In the last fifteen years many leading experts in economic pedagogy 
have argued that rather than merely passing courses, economics students 
should be required to demonstrate proficiency in doing economics. 
Although doing economics (in the form of economic research) is a key 
element in graduate education, it has not been the norm at the under¬ 
graduate level. Over the last decade this has changed as an increasing 
number of institutions have added an undergraduate research emphasis 
to their economics programs. This book aims to guide these novice 
researchers, whether they are taking an undergraduate course in research 
methodology, completing a senior capstone course, pursuing a senior the¬ 
sis, or attempting a research project early in graduate studies. 

The objective of Doing Economics is to make explicit to novice 
researchers what experienced researchers know implicitly about the 
research process. The book’s principal thesis is that research is funda¬ 
mentally a process of constructing persuasive arguments supported by the¬ 
ory and empirical evidence. The more technical issues of literature 


x 


xi 



Preface 


searching, theory formulation, data collection, and empirical testing are 
the details by which the process occurs, but tire big picture is about build¬ 
ing a case around one’s interpretation of an interesting issue or problem. 

Research is often viewed as one of those competencies students must 
have learned in an earlier course or can pick up on their own. But experts 
in the field of research and writing expose this view as a fallacy. 

Although many research skills are taught in traditional courses, the 
broader skills required for research design tend not to be. Traditional 
courses succeed in teaching die technical skills—for example, drose 
involving analytic reasoning. When asked to apply a specific theory to an 
issue or problem, or to perform a specific statistical test on a given data set, 
most novice researchers do quite well. However, traditional courses are 
less successful in teaching the broader, creative skills required for 
research. These broader, creative skills include: 

® The ability to frame a good research question when given an issue or 
problem 

• The ability, to identify and apply an appropriate model to a given 
research problem 

• The ability to develop an appropriate data set 

0 The ability to select appropriate ways to test a specific hypothesis 

° The ability to use the results to create a scholarly argument 

More generally, novice researchers fail to grasp that research is not a 
straightforward, mechanical process, but one that is replete with ambigu¬ 
ities: there is more than one correct way to approach a given research 
problem, as well as many incorrect ways. While research can never be 
reduced to a simple cookbook process, it can be taught in a more explicit 
manner, and this text seeks to make the steps involved in doing research 
clear to the novice researcher. 
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What Is Research? 


“I hear and I forget; I see and I 
remember; I do and I understand ." 

CONFUCIUS 



he best way to learn economics is not to hear about it, or to read 
|| about it, but to do it. Doing economics means performing eco- 
nomic research. As Booth et al. (1995, 2) note, “Doing research 
El!-'.. . can help you understand the materia] you are studying in a way 
that no other kind of work can match.” 

Faculty often view research ability, like writing, as a skill that students ei¬ 
ther must have learned earlier in their education or can pick up on their 
own. After all, haven’t all undergraduates written term papers? In fact, ex¬ 
perts in the field of research and writing have exposed this as a fallacy. 
Bean (1996), a nationally known scholar of critical thinking, points out that 
most undergraduates do not really understand how to write a discipline- 
specific research paper. 

This book is intended to remedy this problem. It is designed to guide 
students through the research process from the conception of the re¬ 
search question to the completed research report. Students often find that 
research is a difficult process. As they struggle to complete the project they 
conclude that something is wrong with them. What they fail to grasp is that 
research is difficult. It is not a straightforward, mechanical process but one 
replete with ambiguities, wrong turns, and “wasted” efforts. Progress in the 
research process often comes in fits and starts rather than evenly. It isn’t 
always or even usually predictable. In contrast to most of the assignments 
students encounter in their education, with research there may not be a 
well-defined answer. Every researcher, including experts, feels confused or 
even overwhelmed at some point during a research project. The differ¬ 
ence is that experienced researchers, like marathon runners hitting “the 




wall,” know that this is normal and that with perseverance one will get 
through it. 1 In short, this book is designed to make explicit to undergrad¬ 
uate economics students what experienced researchers know implicidy 
about the research process. 

The book is an introduction to research methodology in economics. 
Methodology is more than a fancy word for methods. Research methods 
are the tools of economic research: for example, online search skills, crit¬ 
ical reading and wilting, and statistical methods. Research methodology is 
how those tools are combined to produce valid and reliable research. 

Research Is the Creation of Knowledge 

One popularly held view of research is that it means “the search for 
knowledge.” This definition seems to view knowledge as like fruit on a tree 
in a forest. Thus, all that is necessary in the research process is for the re¬ 
searcher to discover the tree and collect the knowledge, much like a 
farmer picks fruit. 

This idea of knowledge is based on the traditional view of science as en¬ 
tirely objective, where the data, when collected, “speak for themselves.” In 
other words, the data will yield the same conclusions to any researcher. 

This view fails to differentiate between knowledge and facts. Knowledge 
is what is believed to be true about something, what is believed to be a cor¬ 
rect understanding of something. Facts are just data. Knowledge, by con¬ 
trast, is facts with meaning; that is, it is an expert’s best interpretation of 
tire facts. You can see this distinction if you compare the results from a 
chemistry lab (i.e., the data) with the researcher’s discussion of the results 
in a lab report (i.e., the interpretation). Only the latter is the knowledge. 

Research, then, is not merely searching for facts. Research is more com¬ 
pletely defined as the creation of (valid) knowledge. 2 Facts are discovered ; 
knowledge, as an interpretation, is created , 3 Instead of a farmer harvesting 
a fruit tree, a better analogy for research is a detective searching for clues 
and then developing a case on the basis of those clues or other evidence. 
Scholars create knowledge by constructing arguments. In a research con¬ 
text, an argument is not primarily a quarrel or controversy. Rather, an ar¬ 
gument is a point of view or position on a question. More formally, an 
argument is an assertion or claim supported by reasons or evidence. 4 
Knowledge in any discipline can be thought of as a conversation or dialog 
between scholars as they develop competing arguments. One example fa¬ 
miliar to students of macroeconomics is the conflicting views between the 
Monetarists and the Keynesians. Through this dialog, arguments are eval¬ 
uated, the weaker ones are winnowed out, and the stronger ones are re¬ 
fined and improved. 5 Thus, over time knowledge in the field advances. 


How Are Arguments Evaluated? 


Students and even some faculty believe that it is simply unrealistic to ex¬ 
pect undergraduates to be able to perform original research. Cohen and 
Spencer (1993, 222) cite student comments such as “How can I tell you 
anything you don’t already know?” and “How can you expect an under¬ 
graduate to say anything original?” 

Part of the reasoning behind such comments stems from the percep¬ 
tion that research includes only groundbreaking, paradigm-shifting ex¬ 
amples like the theory of evolution or the theory of relativity. There are 
two problems with this perception. First, the majority of research repre¬ 
sents only marginal improvements in our understanding. This is not to say 
that such improvements are not important, merely that they represent rel¬ 
atively modest advances in our knowledge. Second, even major “break¬ 
throughs” are based on the work that came before. For example, Ethridge 
(1995) points out that the “discoveries” we associate with such notables as 
Alfred Marshall and John Maynard Keynes can be traced back to earlier 
work by lesser-known scholars. 

In fact, undergraduates can complete serious, legitimate research proj¬ 
ects. This is attested to by the growing number of undergraduate eco¬ 
nomics programs that require senior research projects and the increasing 
number of journals that publish undergraduate research. 6 Booth et al. 
(1995, 7) assert that “it is no exaggeration to say, that done well your [re¬ 
search] will change the world tomorrow.” 

Ethridge (1995) distinguishes between two types of research: discovery, 
that is, “formulating, finding, and creating new knowledge or informa¬ 
tion” and confirmation, that is, “discerning the validity or reliability of 
knowledge or information.” Undergraduate research may be largely con¬ 
firming, such as running new tests of previously established theories. Such 
research is still considered “new” in that it adds to our knowledge by, for 
example, applying the previous theory to new data or new situations, 
which if successful, broadens its applicability. 

How Are Arguments Evaluated? 

Earlier, we defined research as the creation of valid knowledge, and we 
described knowledge as a dialog between competing arguments. Argu¬ 
ments “compete” through their validity. What makes knowledge valid? 
Each discipline has its own approach and language of discourse, but they 
all boil down to the use of logic and evidence to support a conclusion. 7 

Let’s explore this idea in more detail, to find out how arguments are 
evaluated and either (provisionally) accepted or refuted. 

First-year college students tend to think “dualistically.” 8 That is, they per¬ 
ceive (nearly) every question to have a unique correct answer, like a problem 



in mathematics, for example, 2 + 2 = 4. Students think in terms of black and 
white, despite the fact drat we live in a gray world. This perception is not their 
fault; they’ve been trained for at least twelve years to t hink that way. 

By the time students are sophomores and juniors, their thinking has ad¬ 
vanced to “multiplicity.” That is, questions have more than one correct an¬ 
swer. For example, in macroeconomics students learn both that “The 
Great Depression was caused by inappropriate monetary policy” and that 
“The Great Depression was caused by instability in the private sector.” Most 
students conclude that since multiple points of view exist, knowledge and 
truth are essentially subjective! Though multiplicity is a more sophisti¬ 
cated form of thinking than dualistic thinking, it’s not valid to conclude 
that knowledge and truth are subjective. In truth, the fact that there are 
diffei ent points of view about an issue or question does not imply that all 
points of view are correct or equal. (I have often said to my students, 
“There is more than one correct answer to this question, but there are also 
an infinite number of incorrect answers.”) Suppose you are a juror in a 
criminal trial and both the prosecution and defense bring in experts who 
contradict each other. There are two points of view, but only one is true: 
the defendant is either guilty or innocent. How do you decide? 

To make a valid decisioir, you will need to think critically about the tes¬ 
timony. 9 Critical thinking is one of drose concepts (like “liberal educa¬ 
tion”) that college students are familiar with but often find difficult to put 
into words. Missimer (1995) defines critical thinking as the evaluation of 
competing arguments on the basis of tireir evidence. Thus, it involves the 
ability to recognize and assess an argument and its constituent parts, in¬ 
cluding the assumptions, logic, and evidence. This will be explained in de¬ 
tail in Chapter 4. For now, we can say that when scholars evaluate different 
arguments, they ask questions such as: 

What are the reasons behind the argument? 10 

Does the argument make sense? Why or why not? 

Is the logic flawed? 

What are the underlying assumptions (explicit and implied)? Are 
they flawed? 

How critical are the assumptions? That is, would different assump¬ 
tions lead to different conclusions? 

What is the empirical evidence? Does it support the conclusion? 

In light of the reasons and evidence provided, is the argument per¬ 
suasive? If so, the conclusion is valid. 11 

Let s think about how this winnowing process works. Arguments are 
evaluated to see if they stand up to intellectual scrutiny according to die 
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Summary 


criteria just given. If flaws are discovered, the arguments are disproved, af¬ 
ter which they may be revised or discarded. For the moment, however, they 
are removed from consideration. If flaws are not discovered, the argu¬ 
ments are only provisionally accepted, never proved. 

That arguments can be disproved but never proved is sometimes diffi cult 
for people to grasp. Consider a sports metaphor. Who is the best tennis 
player in the world? Players compete and as long as they win, they remain in 
the running for the title. When they lose, they are (at least temporarily) out 
of contention. But no player can ever be the best forever, only provisionally 
so, until a better player comes along. It is the same way with arguments. 

Think of an argument as a rope across the Grand Canyon. Each test of 
an argument is like a thread. No one test is strong enough to bear your 
weight. Over time, however, when multiple tests Confirm the argument, the 
thread becomes a string, the string becomes a rope, and the rope becomes 
a thick cable. At some point, you become willing to trust it. By contrast, 
imagine that when a test of the argument fails, the thread is cut. If there are 
enough failures, the rope is weakened, and ultimately severed in two. 

In short, we evaluate the supporting evidence so as to weed out flawed 
arguments. If we are left with a single point of view, we can conclude that 
it is correct. Often, however, there is more than one argument left. 12 How 
do we choose then? When scholars face more than one argument they can 
choose between them on the basis of disciplinary norms, such as efficiency 
or equity. 13 For example, consider the following two positions: 

Free international trade is better for a nation than protectionism because 
under free trade a nation will have more goods and services, that is, a 
higher GDP. 

Tariffs and quotas are necessary to protect workers and businesses be¬ 
cause certain sectors of the economy cannot compete internationally un¬ 
der free trade. 

Both are arguable: the first on the basis of efficiency and the second on 
the basis of equity. 

SUMMARY 

® Research is the search for or creation of valid knowledge. 

* Rather than a collection of facts, knowledge is interpretation of the facts. 

* The community of learners in any discipline represents a dialog between 
competing arguments: assertions made about the correct interpretation 
of the facts. 
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* H °w do arguments compete? What makes for valid knowledge? 

0 Valid research is research that is demonstrable and persuasive to other 
scholars in the field. 

* Research is persuasive when it is done systematically, by design, and 
where the conclusions are supported by reason and empirical evidence. 

MOTES 

1. Booth et al. (1995, 23-25) make a similar point. 

2. Ethridge (1995) defines research as “the systematic approach to ob¬ 
taining new and reliable knowledge” (p. 16). 

3. Truth is not created, but our understanding, our interpretation, is created. 

4. Missimer (1995). 

5. See Appendix 1A for a more detailed treatment of how knowledge is 
created in economics. 

6. Chapter 2 provides specific examples of outlets for undergraduate re¬ 
search in economics. 

7. Ethridge (1995) defines validity as “demonstrable to others based on 
reason and evidence.” This is a useful way to think of it. Remenyi et al. 
(1998, 24) argue similarly that a “researcher has to be able to convince 
an audience of the value and relevance of his or her research ef¬ 
forts. ... In addition, the academic researcher needs to explain why 
his or her research should be considered important and needs to be 
able to point out precisely what was found and what use the findings 
are to the community.” 

8. This view of thinking, including the labels “Dualistic” and “Multiplic¬ 
ity,” is from Nelson’s (1989) interpretation of Perry (1970). 

9. Ethridge (1995) notes that critical thinking is an essential element for 
research. 

10. Missimer (1995) calls assertions that are unsupported by reasons 
“loose arguments.” 

11. Nelson (1989) characterizes this level of thinking as “Contextual 
Relativism.” 

12. This is analogous to the concept of Pareto optimality, as demonstrated, 
for example, by a production possibilities frontier, where all of tire 
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Suggestions for Further Reading 

“points” are Pareto efficient but to choose between them requires 
some additional criteria for judgment. 

13. Nelson (1989) labels this level of thinking as “Contextually Appropri¬ 
ate Decisions.” 

SUGGESTIONS FOR FURTHER READING 

Bean (1996), especially Chapter 12 —Useful guide to teaching research, 
writing, and critical thinking to undergraduates. Chapter 12 is a re¬ 
thinking of how to use research papers as an effective tool for un¬ 
dergraduate education. Bean emphasizes the view of research as 
developing an argument. 

Booth et al. (1995) —Classic text on college-level research. High-level re¬ 
search text: not concerned with mechanics, but rather research design 
and argument construction. Focus is on research in the humanities but 
is very applicable to research in the social sciences as well. 

Blaug (1992) —Well-kn own book-length survey of the range of methodologies 
used by economists. For a similar but shorter survey, see Hausman, 1989. 

Ethridge (1995) —Readable guide to research methodology in economics at 
the graduate level. Ethridge teaches agricultural economics; as such, 
the book is very applied and thus comprehensible to undergraduate re¬ 
searchers. See especially Chapters 1-4 on methodological foundations 
of economic research. 

Friedman (1968) —Thoughtful article on research methodology as prac¬ 
ticed by economists. For a similar article, with a slightly different point 
of view, see Machlup (1965). 

Hausman (1989) —Survey article on economic methodology. For a more 
detailed survey, see Blaug (1992). 

Machlup (1965) —Another view on research methodology as practiced by 
economists. Compare with Friedman (1968). 

Nelson (1989)-— Widely used summary of Perry’s taxonomy of critical thinking. 

Perry (1970) —Classic study of the levels of critical thinking obtained by 
U.S. undergraduates. 

Bemenyi et al. (1998) —Guide to research methodology in business admin¬ 
istration. Designed for master’s- or doctoral-level research, but as an ap¬ 
plied text, it has much to offer undergraduate researchers. 



chapter 1 • What Is Research? 


EXERCISES 

1. Economists have a reputation for being unable to reach a consensus on 
issues. You may have heard the joke that if you laid all the economists 
in the world end to end, they would fail to reach a conclusion. This rep¬ 
utation is due, in part, to the way economic knowledge is created. The 
reference list for this chapter includes articles by Fritz Machlup and 
Milton Friedman, two respected economists who address this issue. 
Read the articles by Machlup (1965) and Friedman (1968). According 
to each author, what are the mtyor reasons for disagreement among 
economists? What can you infer from this list of reasons about how eco¬ 
nomic knowledge (i.e., agreement) is created? 
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The Range of Economic 
Methodologies 

We observed that scholars in any discipline present competing argu¬ 
ments to resolve issues. This is also true of those who study economic 
methodology. The view presented in this chapter represents a consensus 
of the majority of economists, but the dialog of economic methodologists 
includes a range of views. 

Hausman (1989) argues that real-world complexities in the social sci¬ 
ences, such as researchers’ inability to conduct controlled experiments, 
preclude any chance of convincing empirical evidence. Thus, scholars 
should use inductive reasoning to establish the basic psychological or tech¬ 
nological laws of economics, such as utility maximization or diminishing 
marginal productivity. 

Then, for a specific application of such a law, one can deduce the eco¬ 
nomic implications. For example, assuming consumers maximize utility, 
one can deduce that higher prices result in fewer items purchased. 

At the other end of tire methodological spectrum is Blaug (1992), who 
argues for falsification. That is, not only should hypotheses be judged on 
the basis of empirical testing, but scientists should actively try to refute tire 
hypotheses of their research. They should reject those that are refuted, in¬ 
stead of refining and retesting ad nauseam. 

The view presented in this chapter is closer to Blaug than to Hausman. 

One view, which is somewhat outside this spectrum, is McCloskey 
(1998). She argues that economists don’t rely merely or even primarily on 
scientific forms of proof such as logic and empirical evidence, but on lit¬ 
erary ones as well, such as introspection, case studies, and metaphor. That 
economists use these devices is clear; whether or not they can persuade 
other experts of their views without using scientific forms of proof is not. 





Process in Economics 


'If we knew what we were doing, it 
wouldn't be called research, would it?" 



ALBERT EINSTEIN 


” P revious chapter, we defined research in fairly generic terms. 
While we used examples from economics, the story could have been 
^sipphed to almost aiiy discipline, from anthropology to zoology. Now 
... Net s focus more closely on the research process in economics. This 
chapter will provide an overview of that process, with an emphasis on the 
first step: developing an effective research question. Subsequent chapters 
will examine the remaining steps of the process in detail. At the end of the 
introduction to each step in this chapter, you will be referred to the ap¬ 
propriate chapter later in the book. 1 

At this stage, most of the material introduced in this chapter will seem 
ike Greek to you. Though you may have already been exposed to some of 
what we discuss here, until you complete your own economics research 
project you are unlikely to really grasp it. That’s okay! Everyone in your 
class is in your shoes. And every professional researcher has gone through 
this process. You can be sure that by the end of your project, everything 

will make a great deal more sense. 7 g 


Research in Science and INJonscience Disciplines 

Academic knowledge can be divided into two classes: science disciplines 
and nonscience disciplines. The difference between science and other dis- 
ciphnes lies both in the types of questions they explore and the kinds of 
proof they accept as convincing. Scientific questions are fundamentally 
questions of objective fact: what causes earthquakes or how do catalysts 
work. Nonscientific questions are questions about the affective realm: 
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what is the meaning of life or how does a sunset make you feel? Science is 
research that is empirically testable, at least in principle, while research in 
the nonscientific disciplines is not empirically testable. This is not to say 
that there are no standards of proof in nonscientific disciplines, merely 
that the proof is normative and not based on empirical facts. 

Scholars in tire sciences perform research using the scientific method. 
The scientific method is a set of procedures for drawing valid, reliable, and 
objective conclusions. These procedures include the following steps: 

• Select a scientific problem or question; 

° Apply a theory to derive a hypothesis about the problem or question; 

• Test the hypothesis by comparing its predictions to evidence from the 
real world; 

• If the hypothesis fails the test, modify it (and retest) or reject it; 

• If the hypothesis passes the text, provisionally accept it; 

° Test the hypothesis in a new context. 

The scientific method is not an arbitrary set of procedures. Rather, for 
research to be valid it needs to be done systematically. Spector (1981, 7) 
observes, “It is not necessary to use scientific methods to answer [research] 
questions. One might rely on intuition or educated opinion.” 1 Isn’t this 
how most people answer questions, especially those for which they lack 
specialized knowledge? The problem with such nonscientific methods is 
that they are subject to one’s biases and other subjective factors. Psycholo¬ 
gists have identified the “fallacy of personal validation,” in which people 
tend to “see” what they hope or expect to see in evidence. The scientific 
method, while not infallible, is designed to avoid or at least minimize these 
biases, and result in objective findings. 

Steps of the Research Process for Economics 

Researchers in economics, as a social science, use a version of the sci¬ 
entific method. This research process in economics can be broken down 
into six steps, shown in Table 2.1. 

As we discuss these six steps, you will see that a research project is not a 
linear process; rather, the steps are overlapping and iterative. Instead of 
completing the steps one by one until the end, often the researcher finds 
it necessary to return to an earlier step in the process as he or she revises 
his or her thinking. 2 For example, though you need to begin thinking 
about the research question early in the process (Step 1), after you have 
reviewed what other scholars have done in tire area (Step 2) you may need 
to rerise the question. Similarly, in determining how to perform the em¬ 
pirical testing (Step 4), you may need to consult previous testing method- 
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NOTES FOR NOVICE RESEARCHERS 

Requirements for a Good Research Project 

There are a number of things necessary for completing a good piece 
of research. These include: 

• A Good Research Question (Chapter 2) 

• A Testable Hypothesis (Chapter 7) 

• A Good Data Set (Chapters 8-9) 

• An Empirical Methodology that Adequately Tests the Hypothesis 
(Chapters 10-11) 

(Each of these is discussed in detail in the chapters indicated in 
parentheses.) 

If your project lacks any of these four items, the best you can hope 
for is a mediocre outcome in your research. It makes sense then, at 
least early on in your project, to think about these items in parallel, 
rather than sequentially, since any one of them can trip you up. For 
example, as soon as you identify a research question, you should begin 
looking for data in that area, even if that means reading ahead in the 
appropriate chapters of this book. 

Note also that these four things are necessary, but not sufficient, for 
a good project. In other words, they don’t guarantee a good project will 
result. In addition, you will need to correctly interpret your empirical 
results as well as persuade the reader of your conclusions. 


The Research Process in Economics 

1 . .Developing an Effective Research Question 

2. Surveying the Literature on the Topic 

3. Analyzing the Issue or Problem 

4. Testing Your Analysis 

5. Interpreting the Results and Drawing Conclusions 

6. Communicating the Findings of the Research Project 
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ologies (Step 2). Additionally, lack of adequate data for testing (Step 4) 
may require you to reconceptualize the problem (Step 3) in a way that is 
testable. Finally, it always makes sense to begin writing the research report 
(Step 6) as you progress through the research process, while the various 
steps are fresh in your mind, rather than waiting until the end. In short, it 
may be more accurate to describe research as a looping process. Indeed, 
Ethridge (1995) observes, “Research is a creative process. . . . There is no 
‘magic formula’ for accomplishing or producing good economic research. 
If it were that simple, anyone could do it.” In this respect, performing re¬ 
search cannot be reduced to following a recipe. Nonetheless, it is helpful 
to think about the process in terms of discrete steps. 

Step 1; Developing an Effective Research Question 

At the start of the process, the researcher must define the scope of the 
project. This requires answering, at least tentatively, three questions: 

1. What is the research topic? 

2. What is the research question? 

3. What is the research hypothesis? 

Let’s define these terms. The research topic is the general area the proj¬ 
ect will cover. For example, the topic might be “unemployment in the 
United States.” The research question is the specific focus of the research. 
For example, “How do different levels of education affect unemployment 
rates?” The research hypothesis is the researcher’s proposed answer to tire 
research question or the researcher’s principal assertion about the topic, 
which will be supported by the paper. 3 For example, “Individuals with 
higher levels of education will tend to have lower levels of unemploy¬ 
ment.” You may have to start with a tentative hypothesis until you get fur¬ 
ther into the research. That is normal. The procedure for determining a 
research hypothesis is explained below in Step 3. 

Notice that, grammatically speaking, a research topic is a subject, while a 
research question must be a complete sentence. The same is true of the re¬ 
search hypothesis, which must also be a complete sentence. This is not merely 
grammar—a subject may indicate only a vague, general idea of the research, 
while a complete sentence shows a greater depth of thinking about it. 

During the planning phase, the most important of these three concepts 
is the research question. A good one will make tire research project much 
easier and more likely to be successful. A poor one will, at best, make the 
project more difficult. A good research question has several characteris¬ 
tics, listed in Table 2.2. Let’s elaborate on each of these. 



§j Characteristics of a Good Economic Research Question 


• Problem-oriented 

® Analytical (rather than descriptive) 

• Interesting and significant 

® Amenable to economic analysis ” 

• Feasible, given the time and resources available 


A research question is almost always better when it is framed in terms of 
a problem or concern to be addressed. This tends to focus the question in 
a useful way, which is less likely to happen otherwise. 4 Closely related to the 
P™,™ orientation is a second characteristic: a good research question 
should be analytical rather than factual. That is, its purpose should be to 
explain some aspect of the problem rather than describe it. For example 
instead °f asking a question like “What is poverty?” one should ask some¬ 
thing like, Why is the poverty rate among children so much higher than 
the average poverty rate among Americans?” Bean (1996) calls descriptive 
papers all about” papers. In other words, they discuss “all about” a topic 
without necessarily having a clear point or argument. Booth et al. (1995) 
observe that factual questions tend to ask who, what, when, or where while 
analytical questions tend to ask why or how? 

Readers tend to react to descriptive papers by asking “So what?” In 
o ier words, they don’t find the papers interesting or significant. Let’s 
turn to these two concepts next. 

A research question should be interesting both to the researcher and to 
the audience. It should be interesting to the researcher because if it is not 

t ? r rum q r! nd 11 d ! fficult t0 become engaged by the problem. 6 Booth 
et al. (1995, 36) state, “Nothing will contribute to the quality of your work 

more than your sense of its worth and your commitment to it.” Your in¬ 
terest will help you stay motivated until the project’s completion, especially 
if it is a long project. r 7 

More importantly, a good research question should be interesting to 
one s audience. To grasp what this means in the context of research, you 

nm- tC I f ! InderStand 1116 Conce P t of a resear ch community. Booth et al. 
(1995 17 — 18 ) provide a very helpful definition: A research community is 
the collection of scholars who perform research on a topic. These are the 
experts m the field who teach courses as well as publish books and papers 



cnapier 4 v uvervitJW Ul uic ncacaivu riuvcao 111 uwuuimov. 


NOTES FOR NOVICE RESEARCHERS 


Cast Undergraduates Perform Real Research? 

Your instructor knows that you are not a professional researcher and 
so he or she won’t hold you to that high a standard. At the same time, 
undergraduates can and do perform real, original research. In short, 
they can contribute to the conversation of scholars on a topic, so you 
shouldn’t sell yourself short or set too low a bar for yourself. 


on the subject. 6 These scholars may be at your university or at other institu¬ 
tions anywhere around the world. They know one another by virtue of hear¬ 
ing each other’s research presentations and by reading each other’s 
published work. A research question should be of interest to this audience 
because if it is not, the research project is not worth doing. Remember, the 
point of research is to contribute to the conversation of scholars on an issue. 

At this point, student researchers face a real problem. They actually 
have two audiences: the theoretical audience just described and a practi¬ 
cal one consisting of their classmates, their instructor, and anyone who 
might read their paper if it is submitted for publication. We will make 
some suggestions about how to deal with this problem later, but for now, 
suffice it to say that a good research question is one that interests your 
readers. 

When we talk about scholars being “interested” we mean more than 
that they think the question is entertaining. In a research context, interest 
is closely related to the concept of significance. Is the question a “hot 
topic,” an area of current interest to the research community? Booth et al. 
(1995, 18-19) note, “If your question is already a live issue in your com¬ 
munity, most readers will care about it when you pose it.... [Indeed, they] 
will be especially interested if you can convince them that they don t un¬ 
derstand [it] as well as they thought they did.” However, if your topic is not 
a “live issue,” you will have to actively convince your audience that the 
question is worthy of study. 

Booth et al. (1995) define significance by asking how weighty is the prob¬ 
lem on which the research focuses. The greater the weight of the problem, 
that is, the higher the costs of leaving it unresolved, then the more signifi¬ 
cant the problem is from a research perspective. Consider three examples. 
Poverty among able-bodied individuals who choose not to work is not con¬ 
sidered to be a very significant problem. Poverty among the elderly who 
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Example of a Good Research Question: To What Extent 
Does Parental Employment Adversely Affect Children’s 
Cognitive Development? 

This is an example of a good research question. To understand why, 
consider how the criteria from Table 2.2 apply: 

8 Problem-oriented —-In the last generation, the labor force partic¬ 
ipation rate among women has increased dramatically. As a result, 
the typical U.S. family is no longer characterized by one parent 
(usually the father) working, while another parent (usually the 

, mother) stays home with primary responsibility for child rearing. 
Rather, it is increasingly the case that both parents are employed, 
and the children are placed in child-care. Given that more and 
more children are receiving child-care by someone other than a 
parent, does this adversely impact children’s cognitive develop¬ 
ment? If so, perhaps families will need to revisit the issue of em¬ 
ployment by both parents. 

8 Analytical —To address this research question, one would need 
to study the process of children’s cognitive development and, in 
particular, the role that parents or other child-care providers play. 
This requires an analytical approach. . 

8 Interesting and Significant— Past studies have indicated the 
possibility that children’s cognitive development is adversely af¬ 
fected by the lack of parental input. However, these studies have 
not developed a consensus on either the magnitude or perma¬ 
nence of this effect. Clearly, researchers have more to learn about 
this question, which affects many American families. 

® Amenable to Economic Analysis —Noneconomists may won¬ 
der about the relevance of economic analysis to children’s cogni¬ 
tive development, a topic that is more normally associated with 
the field of educational psychology. However, this topic is a 
straightforward application of labor economics and human capi¬ 
tal theory. In particular, economists studying this issue have ap¬ 
plied the theory of an educational production function, by which 
parental care and other inputs such as spending on food, shelter, 
health care, and the like are combined to “produce’’ educated 
children. 
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e Feasible, given the time and resources available—The issue of 
feasibility obviously depends on the researcher and the context 
in which the research is being done. Suffice it to say here that 
there are a number of excellent data sets that can be used in 
this research, including the National Longitudinal Study of 
Youth and the Panel Study of Income Dynamics-Child Develop¬ 
ment Survey. 


were not able to save adequately for retirement could be considered a sig¬ 
nificant problem. Poverty among children, for which they are not respon¬ 
sible and which may continue into their later lives, is probably a very 
significant problem. The ultimate test of significance, say Booth et al. 
(1995), is the extent to which a research paper asks readers to change their 
beliefs. 

What makes an economic research question? Recall from your first- 
principles course that economics was defined as a content area (i.e., in¬ 
volving questions that originate from the general condition of scarcity), 
but more importantly as a way of thinking. Is die question or problem that 
the research will examine amenable to economic analysis? In other words, 
can economic analysis give us some insights into the problem? For exam¬ 
ple, does the question deal with choice under constraints? Does it involve 
demand or supply, needs, wants, availability? If the answer to any of these 
questions is yes, then you have an economic research question. 

Finally, a researcher should be able to complete investigation of the re¬ 
search question in a reasonable period of time, for example, a semester or 
an academic year, given the resources available. A primary consideration 
here is the availability of adequate data to assess the hypothesis. Some re¬ 
searchers are fortunate enough to begin their project with a rich data set. 
For them, the challenge is determining an interesting and significant re¬ 
search question they can explore with that data. Most researchers, how¬ 
ever, start with the question and then need to search out suitable data. For 
further information on obtaining data, see Chapters 8 and 9. 

Students doing a research paper for a class often find it a bit difficult to 
select a research question—and for good reason: the assignment is artifi¬ 
cial. In the real world, a business economist is told, “Here is a problem 
research it!” Similarly, an academic economist does not say, “Gee, I have to 
do a research project, so let me think of a topic.” Rather, as an expert in 
some field(s), he or she is aware of where the research frontier is, what is 
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known, and what issues or problems in the field have yet to be satisfacto¬ 
rily explained. Those are the research questions he or she is likely to 
choose. This suggests a strategy for selecting student research questions: 

1. Pick a general topic area that interests you, ideally one in which you 
have some background (e.g., an area in which you have taken a course). 

2. Start reading the literature, not merely to see what has been done, 
but also to identify what questions remain to be answered or what 
problems remain to be solved. 

3; Select a promising research question from what you have found in 
the literature. For example, could an interesting previous study be 
applied to a new place or time? Alternatively, are there conflicting 
findings on some question? If so, you might try to reconcile the con¬ 
flict by examining the issue again, using a different set of data or a 
different methodological approach. Studies conclude by suggesting 
questions for further research. Perhaps one of these questions 
would be feasible. Finally, the literature survey may reveal gaps in 
the literature that you could explore. 

Examples of good research questions that undergraduates have investi¬ 
gated in recent years include: 

0 What factors determine the demand for blood in the Fredericksburg, 
Virginia, region? 

« Do union jobs systematically pay higher wages than nonunion jobs 
when disaggregated by occupation? 

• Does class attendance influence student performance in macro prin¬ 
ciples courses? 

• How much of a premium are consumers willing to pay for organic 
vegetables? 

• How much are consumers willing to pay for cleaner streams in the 
Enoree watershed of South Carolina? 

• Do nearby polluted streams affect housing prices? 

Step 2: Surveying the Literature 

The literature on a topic refers to the research that has been completed 
on that topic. When a researcher reviews or surveys the literature, he or 
she is asking the following questions: 

• What is currently known? 

• What has been discovered to date on a given topic? 

Thus, the objective of a literature survey is for the researcher to identify 
and become familiar with the major studies that have been published on 
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a topic Only after completing the literature survey can he 01 she finalize 
the research question. We mil discuss surveying tire literature m more de¬ 
tail in Chapter 3. 

Step 3: Analyzing the Problem . 

The objective of a research project is to explain some aspect of a sigmf- 
icant issue or problem. This step is that explanation. It is the conceptual 
or theoretical analysis of tire issue or problem, where theory is applied so 
as to shed light on the problem. 

Many students find this to be the hardest part of the research process 
indeed, some students explicidy omit this part of the project. This is a sig¬ 
nificant flaw in their research, since die “analysis” is the central part of e 
process. What does it mean to “apply theory to an issue or pioblem . The¬ 
orizing sounds abstract and difficult, but this step need not be. By defini¬ 
tion a theory is a simplified version of reality that allows die i esearcher 
more easily analyze the issue or problem. As a researcher, you ne j; d to as 
yourself: what are the essential concepts comprising the P r ° bl “» be 1 J 
searched? (The nonessential concepts can be ignored.) How ® 
sential concepts related? What do these relationships imply. The result o 
tliis process of analysis or conceptualizing is the research hypothesis. 

Probably the single most important determinant of how successful youi re¬ 
search will turn out is die hypothesis you work with. Recall drat the h YP oth 
sis is the proposed answer to the research question. This suggests a way to start. 

If tire research question is: 

What caused U.S. consumer spending during the 1990s to increase at a 

faster rate than income? 

A tentative hypothesis could be: 

U.S. consumer spending during the 1990s increased at a faster rate than 

income because. .. 

So how does one fill in the dots? How does one figure out a reasonable 
hypodiesis? One way is to derive die hypothesis from thco1 ^^^^ 
ideas derived from the literature. What economtc ffieory or iheones ar 
applicable to tire current problem? There is a well-developed theory o 
consumer spending. What does dris consumption theory suggest affec 
consumer spending in addition to income? Theory suggest;> that house- 
hold wealth and interest rates influence consumer spend g_ 
those factors change during the 1990s? Interest rates were ^ Cesit 
over the last half of the decade; stock markets, however, b^LDo«* 
appear that one or more factors can explain the change m consum 
spending? If so, that could be your hypothesis: 

Hi 
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U.S. consumer spending during the 1990s increased at a faster rate than 
income because of the increase in household wealth caused by booming 
stock markets. 

We will discuss conceptualizing the model in more detail in Chapter 7. 

Step 4: Testing Your Analysis 

In terms of research methodology, there is a major divide between sci¬ 
ence and nonscience disciplines. Science is research that is empirically 
testable, at least in principle, while research in the nonscientific disciplines 
is not empirically testable. This is not to say that philosophy, literature, or 
any of the other humanities is less valuable or less valid than biology, chem¬ 
istry, or other sciences. All this implies is that the humanities use nonem- 
pirical methods to validate their scholarship. 7 

Economics as a social science uses a version of the scientific method in re¬ 
search. 8 What does it mean to test a theory empirically? The researcher asks 
the question, If the theory is correct, what evidence should there be of it in 
the real world? For example, if the theory of supply is valid, then if the price 
of some product rises, producers should supply greater quantities of output. 
In other words, testing a theory means comparing the implications or pre¬ 
dictions of the theory with appropriate real-world evidence. The data are 
collected and analyzed. If the theory fails, then it is modified and retested, 
or it is rejected. If the theory does not fail, it is provisionally accepted. 

Take, for example, what happened to unemployment during the Great 
Depression. By 1933, the unemployment fate had increased to 25 percent. 
What did the existing theory suggest? The microeconomic theory of labor 
markets predicted that when faced with a decrease in demand, unemploy¬ 
ment would be only temporary until real wages fell. Though nominal wages 
fell, so did the price level. As a result, it is not clear whether real wages fell 
or not, but unemployment remained above 10 percent until 1942! Since 
the prediction of tire theory (only temporary unemployment until real 
wages fell) was contradicted by the real-world evidence (significant unem¬ 
ployment for over a decade), the theory was shown to be inadequate. 

For this step in a research project, there are really several elements. The 
first is deciding how you will go about assessing the validity of the hypoth¬ 
esis. This is an important part of designing or planning the research. Re¬ 
search design has two parts. The first is locating a good data set, one that 
is large, rich, and accurate enough to adequately test your hypothesis. We 
noted earlier that failure to find a good data set would compromise your 
ability to complete your research successfully. For this reason, it is impor¬ 
tant to begin looking for data early in the planning process. A detailed ex¬ 
planation on data sources will be found in Chapters 8 and 9. 
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Summary 


The second part of research design is selecting a statistical method to 
adequately test your hypothesis. A key question is the following: if the 
statistical test you employ yields the best possible results, how confident 
can you be that your hypothesis is confirmed? In other words, will the 
test adequately discriminate between your hypothesis and alternatives? 
If not, then you should consider a more powerful test. Oftentimes, how¬ 
ever, lack of adequate data will limit your ability to use a more powerful 
test. This underscores again the importance of finding a good data set. 
A discussion of statistical methods for testing hypotheses will be found 
in Chapter 10. 

Once you have collected your data and settled on an appropriate statis¬ 
tical test, tire next element is doing that assessment. This typically requires 
no more than running a computer program. 

Step 5: Interpreting the Results and Drawing Conclusions 

This step is closely related to the previous one, but it is sufficiently im¬ 
portant that it needs to be emphasized separately. Ethridge (1995, 29) ar¬ 
gues that this step “is the one most often overlooked or underestimated by 
students as they are being initiated to the research process. They often see 
the empirical computer results as tire end product of the process. Actually, 
the interpretation of those results may be more challenging, and more im¬ 
portant, than the process of obtaining the numbers.” 

What are the results of the empirical testing of the research hypothesis? 
Are they consistent with the predictions of the theory? Are there any prob¬ 
lems in tire testing protocol that need to be corrected to obtain valid re¬ 
sults? What are the shortcomings of the testing methodology that limit or 
weaken the results? In light of the answers to these questions, what can be 
concluded about the results? To what extent do they support the hypoth¬ 
esis? How do the results compare with those of earlier studies? More gen¬ 
erally, what can be concluded about your analysis and about the research 
question more broadly? 

To be sure, there are numerous additional details to consider when in¬ 
terpreting statistical results. These details will be discussed in Chapter 11. 

We need to speak to one final but often misunderstood point. What is 
good research? Good research is research that follows the scientific 
method, wherever the results lead, even if they reject the hypothesis. Recall 
that research is tire search for valid knowledge. Ethridge (1995) observes 
that honest research is open minded; looking for a preconceived conclu¬ 
sion is not honest research. A research project that rejects the hypothesis is 
not a failed project because it still advances our knowledge—in this case, by 
eliminating one hypothesis as an explanation for the problem. 9 



Step 6: Communicating the Findings of the Research Project 

The final step in the research process, and in many ways the most im¬ 
portant one, is communicating the results. This is how knowledge pro¬ 
gresses. The communication begins -with the researcher’s written report 
of the research. This is where the researcher makes a case for the valid¬ 
ity of his or her hypothesis based on the logic and empirical evidence of 
the research. At one level, the research paper really does no more than 
explain what the researcher discovered from each of the steps in the re¬ 
search process. It is important to remember, however, that the research 
does not stand on its own; in other words, the evidence is not self- 
evident. Rather, the purpose of the research report is to present a con¬ 
vincing answer to the research question, in a way that persuades other 
researchers in the field. 

Again, like everything else in the research process, the research report 
is done in several stages. Typically, a draft is written. The researcher asks 
colleagues to review it and provide suggestions for improvement. 

The researcher may present an oral version of the paper at a profes¬ 
sional conference, again to receive comments from other experts. For ex¬ 
ample, economics students have the opportunity to present their research 
at a number of venues, including the annual meetings of the Virginia So¬ 
cial Science Association, the Eastern Economic Association, and the Na¬ 
tional Conference on Undergraduate Research. After each of these stages, 
the researcher’s paper is revised to incorporate what has been learned 
from the feedback received. 

Finally, the paper is submitted to a journal for publication. For exam¬ 
ple, several journals exist to publish undergraduate economics research. 
These include Issues in Political Economy and University Avenue Undergradu¬ 
ate Journal of Economics. Usually, the journal editors ask for still more revi¬ 
sions before a researcher’s paper is published. 


SUMMARY 

The research process in economics has six major steps: 

1. Developing an Effective Research Question 

2. Surveying the Literature 

3. Analyzing the Problem 

4. Testing the Analysis 

5. Interpreting the Results and Drawing Conclusions 

6. Communicating the Findings of the Research Project 
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MOTES 

1. Ethridge (1995) makes the same pointwhen he observes that there are 
six ways to gain knowledge: through the senses, experience, intuition, 
revelation, measurement, and reasoning. Of these, only the last two 
are sources of reliable data. 

2. This book is designed to be used this way too. Though laid out se¬ 
quentially, it includes many references backward and forward. 

3. Note that the hypothesis of your research will become the thesis of 
your research paper. 

4. Note that a research problem is not necessarily a social or other prob¬ 
lem in the real world. For further elaboration of this point, see Booth 
et al. (1995), especially Chapter 4. Bean (1996, 30) describes how 
scholars identify a research problem: “Expert[s] feel an uncertainty, 
doubt a theory, note a piece of unexplained data, puzzle over an ob¬ 
servation, confront a view that seems mistaken, or otherwise articulate 
a question or problem.” 

5. On a project such as an honor’s or master’s thesis or a doctoral disser¬ 
tation, it is critical to choose a research advisor whose interests are in 
your topic area; otherwise, they will find it difficult to give you the com¬ 
mitment you need. 

6. Members of a research community can be identified by reviewing the 
published literature. 

7. Missimer (1995) describes these methods as “speculation.” This term 
is not meant to be dismissive; it merely means a type of reflection sim¬ 
ilar to the conceptualizing that occurs in the theoretical sciences, 
where validity is based on logic and normative standards, but not em¬ 
pirical evidence. 

8. We will discuss the difference between the social and physical sciences 
in Chapter 10 on empirical methods. 

9. This is also consistent with Blaug’s (1992) doctrine of falsification. 

10. Another analogy that might make sense to undergraduates is your 
plan for completing degree requirements. 

11. In a class setting, the audience consists of your classmates and the in¬ 
structor; in a professional setting, the audience may consist of a fund¬ 
ing agency. 
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SUGGESTIONS FOR FURTHER READING 

Booth et al, (1995), Chapters 2-4 —Excellent discussion of what makes a good 
research question. 

Ethridge (1995) —Readable guide to research methodology in economics. 
See especially Chapters 5-9 on designing and carrying out a research 
project. 

Ramanathan (1995), Chapter 14 —Good summary of the major steps in¬ 
volved in completing an empirical research project, from choosing a 
topic to writing the report. 

Siegfried et al. (1991), the section titled “Purpose of the Economics Major,” 
pp. 199-202 —Concise but complete statement of the research method¬ 
ology of economists, which they label the “economic way of thinking.” 

Wyrick (1994) —Guide to economic research for undergraduates. Published 
just before the Internet revolution, but much of the book is useful. See, 
especially, Chapters 9-12. 

EXERCISES 

1. Consider the following sample research questions. Using the criteria 
discussed in this chapter for a good research question, determine which 
are good research questions and which are poor research questions. Ex¬ 
plain your reasoning. 

a. Why is the supply of hospitals low in certain areas, and how does this 
affect the quality and price of the hospitals and doctors in these areas? 

b. Why do consumers buy over-the-counter dietary supplements that 
are said to promote weight loss despite the health risks, and what fac¬ 
tors influence these purchases? 

c. What were the causes of the Great Depression? 

d. What is the effect of foreign direct investment on a nation’s eco¬ 
nomic growth rate? 

2. Design a good research question. Explain how it meets the criteria for 
a good research question. 

3. Write a complete research proposal, following the model described in 
Appendix 2A. You will likely have to read ahead in this book to com¬ 
plete a good proposal. 
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Writing the Research Proposal 


A successful research project requires planning. This is particularly true 
for inexperienced researchers, who often find planning the project diffi¬ 
cult. This difficulty is all the more reason to spend time and effort on it. 

Your research plan should be flexible. You can’t expect the project to 
go exacdy according to the plan, but without a plan it’s hard to know 
where you will end up. Think of it as the blueprints for a house you are 
building. 10 It is not that you can’t or won’t make changes as you go along, 
but you don’t want to start the project without a plan. To develop a plan, 
a scholar must think about the research process. 

The research plan becomes tangible in the form of the research pro¬ 
posal. A proposal is an exercise in persuasion, whereby you seek to con¬ 
vince your audience that you can complete a meaningful and interesting 
piece of research on the topic you have selected. 11 In the proposal, you 
sketch out in detail how you propose to do that. Thus, the proposal de¬ 
scribes the research plan. Ethridge (1995, 85 and 89) points out that “The 
more thorough and complete the proposal, the clearer and more com¬ 
plete the plan” and also that “The proposal is evidence of the quality of 
thought that has gone into the project, and the whole research plan . . . 
will be judged by the quality of the project proposal.” 

A research proposal should have five components: 

1. Statement of the Nature of the Problem: What is the issue or problem 
this research will address? Why is it interesting, significant, and 
amenable to economic analysis? 

2. The Research Question: What is the question that is the focus of the 
research? 

3. Survey of the Literature: Outline the major studies in the literature that 
have a bearing on the research question. Explain clearly how they are 
relevant to your research question. (For a more extensive discussion of 
the written literature survey, see Chapter 12.) 

4. Research Design: What is the analytical framework of the model? For 
example, “This research will apply the theory of demand to study con¬ 
sumer spending on Hula-Hoops.” Note that the research hypothesis 
should follow from this conceptual analysis. 


Appendix 2A 


In an ideal world, what data would you need to test your hypothesis? For 
example, sales of Hula-Hoops, their price, personal disposable income, 
some measure of their appeal to consumers. 

What data have you found to test your analysis, and what is tire source 
of that data? For example, “I plan to test the hypothesis using data on 
Hula-Hoop sales and prices, as well as disposable income obtained from 
the National Income and Product Accounts, Survey of Personal Con¬ 
sumption Expenditures. I was unable to find a measure of coolness.” 

How will you test your analysis empirically? For example, “This research 
will conduct a regression study over tire time period of 1972-1999.” Ex¬ 
plain how this design should produce reliable and valid results. 


(Detailed explanation of these topics will be found in Chapters 7, 8, 9, 
10, and 11, respectively.) 

5. References: List all the references you have read so far and plan to use in 
your paper. Do not list references you have not read or references that 
provide background information but do not have a clear bearing on 
your research question. Use the appropriate bibliographic style. 

In sum, the proposal should answer four questions. What is the issue or 
pi oblem your research will investigate? What specific question will you at¬ 
tempt to answer? How will you attempt to answer it? Do you have adequate 
sources to do the job? If the proposal does not convince the reader that it 
answers all four questions sufficiently, it is not an acceptable proposal. 

If you have a good research proposal, completing the research project 
will be straightforward—you just need to follow the plan. Even if the plan 
changes, the project should be doable with a minimum of problems. If you 
have an incomplete or otherwise poor proposal, it is a signal that the proj¬ 
ect will be difficult or impossible to accomplish successfully. 




SRifim'm a p p e n a i x 


lQ|5Ef 


Questions for Evaluating 
a Research Proposal 


We observed above that a key purpose of a research proposal is to con¬ 
vince tire reader that the researcher can complete a solid piece of research 
on an important topic. When you evaluate a research proposal you should 
consider the following questions and issues: 

• Does the proposal explain die problem or issue sufficiendy to make it in¬ 
teresting and wordiy of a reader’s time? Why or why not? Is there any¬ 
thing in the introductory section of the proposal that ought to be 
explained in more detail? 

• Do you understand the research question? In other words, can you ex¬ 
plain what the research will attempt to do or prove? Is there a clear state¬ 
ment of this? If so, identify it. If not, say so. 

® Does the author appear to have identified the major studies previously 
done on tire topic? Has he or she identified those studies’ contributions 
and shortcomings? (The proposed research should build on the contri¬ 
butions and/or correct one or more of die shortcomings.) 

° Do you understand clearly and exactly what tire author intends to do to 
complete this research? Are there any details missing from die descrip¬ 
tion? If so, what are they? Has the author identified a reasonable data set 
with which to test his or her hypothesis? Has he or she adequately ex¬ 
plained his or her testing methodology? 

• Are there enough sources to write a convincing paper? 

• In sum, what do you see as the proposal’s strengths, and what do you see 
as its weaknesses? 

Your commentary should conclude with an explicit statement about 
whether the proposal is acceptable as is or not. If it is not, you should pro¬ 
vide specific guidelines for what needs to be done to make the proposal 
acceptable. Again, remember that your feedback should be supportive but 
honest. 
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Surveying the Literature 
on a Topic in Economics 


'If I have seen further, it is by standing 
on the shoulders of Giants ." 


SIR ISAAC NEWTON 
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arli6r we noted * at scholars engage in a kind of conversation or di¬ 
ll a l°g of competing arguments as they attempt to increase hu- 
F knowledge about a topic. This conversation manifests 

.ifselfas.pubhshed research, which is described as “the literature” on 

the subject. When you survey the literature on a subject, you are trying to 
identify the major studies that have been published to date. But more im¬ 
portantly, you are trying to understand what the studies say and how they 
relate to one another. We will explain how to read and understand schol¬ 
arly publications in Chapter 6. In this chapter we will explain why and how 
researchers perform a literature survey as a key component of the research 

process. In Chapter 12, we will explain how to write a literature survey as 
part of a research paper. 

Why Is a Literature Survey Necessary? 

Ethridge (1995, 115) observed that before you can “advance the state 
of knowledge,” you need to know what the state of knowledge is. So when 
you survey the literature, you are trying to create your own sense of what 
is known and what is not known about the subject. This is important for 
several reasons. Early in the research process you can get ideas for possi¬ 
ble angles for your research. Later in the process you can show how your 
research fits into and contributes to the larger conversation on the subject. 
This serves to establish your credentials as a researcher by showing read¬ 
ers that you are knowledgeable about the field. 
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There are several additional reasons for surveying the literature. One is 
to avoid duplicating findings drat have already been obtained. In dris fash¬ 
ion you can use someone else’s prior work as a steppingstone for yours. For 
example, Ethridge (1995) points out that the literature can help you de¬ 
sign your own study by showing how previous approaches eidier were or 
were not successful. Similarly, previous studies can suggest ways you might 
handle problems you run into. 

Those new to research sometimes focus dreir literature survey too nar¬ 
rowly. The literature review should not be limited to searching for previ¬ 
ous studies on your topic. It should also examine studies of other topics drat 
use conceptual frameworks or testing methodologies that might be ap¬ 
plicable to your topic. For example, in a research project on the demand 
for blood, it could be useful to examine previous demand studies on re¬ 
lated products, such as tire demand for transplant organs or even on other 
commodities entirely. 

Where to Search: Popular versus Scholarly Literature 

Information on a topic can be found in two types of publications: pop¬ 
ular* and scholarly publications. Scholarly publications are also-referred to 
as scientific or professional publications. What is the difference between 
popular and scholarly publications? Popular publications are addressed to 
a general audience, while scholarly ones are addressed to a specialized au¬ 
dience, namely, experts in the field. Scholarly publications are often pri¬ 
mary sources of information, while popular publications are almost always 
secondary sources. A primary source of information is the original publi¬ 
cation of a research study, written by an expert and addressed to odrer 
scholars in die field. One example would be an article in the American Eco¬ 
nomic Review (AER, a scholarly journal) tiiat explains a new study con¬ 
ducted by die authors of die article on the effects of preschool education 
on poverty rates. A secondary source is a report based on the primary 
source, typically addressed to a more general audience. An article in Time 
magazine summarizing the article in the American Economic Review would 
be an example of a secondary source in a popular publication. 

One popular publication tiiat undergraduates sometimes mistake for a 
scholarly journal is the Economist. This magazine publishes a regular col¬ 
umn entided “Economics Focus” tiiat does an excellent job of summariz¬ 
ing recent research in language that the educated lay public can 
understand. In addition, unlike most popular publications, the Economist 
often prorides references to the scholarly articles in which die research it 
reports on was published. Nonetheless, the Economist is a popular rather 
than a scholarly journal. 

i® ' 



___ . Where to Search: Pop ular versus Scholarly Literature 

To begin die literature survey process, it makes sense to consult popu¬ 
lar sources because they trill give you information on what is generally 
known about a topic. In addition, since tiiey are addressed to general au¬ 
diences they are easy to read and assimilate. Popular sources include en¬ 
cyclopedias; news magazines, such as Time, Newsweek, and Business Week 
and newspapers such as die Wall Street Journal or the New York Times. Note 
that these popular publications are increasingly available online, often for 
roe. For example, the most recent two weeks of die Washington Post are 
available at tvww.washiiigtonpost.com. 

Internet resources such as directories and search engines are also pop¬ 
ular sources of general information. Most undergraduates are familiar 
witii diese tools, so we trill only discuss them briefly here. Directories are 
hierarchically organized catalogs on a variety of subjects. Well-known ex¬ 
amples include http://www.galaxy.com, http://Mw.looksmart.com, and 
http://www.yahoo.com. Virtual libraries are directories tiiat have been 
compiled by librarians and other experts. As a result, they tend to include 
somewhat better resources than ordinary directories. Examples include 
the WWW Vntual Library (http://www.vlib.org) and the Internet Public 
ibrary (http://www.ipl.org). Each of diese has sections on economics 
tiiat can provide helpful background information. 

Another Internet tool is tire search engine. Search engines are com- 
putei programs that create databases of websites catalogued by topic. 
Users can perfoim keyword searches on those databases to obtain infor¬ 
mation from the websites dealing with their subjects of interest. Popular 

Sf Tu!!w ieS mdUde htt P : //™-g«ogle.com, http://ivwwmxcite.com, 
and http://www.altavista.com. A particularly useful search engine is 
MetaCrawler at http://wivw.go2net.com. Instead of searching its own data- 
base, MetaCrawler searches die results of several major search engines, in- 
cludmg Google, Yahoo, Ask Jeeves, About, LookSmart, and others. 1 

I he shortcoming of most general-interest directories and search en¬ 
gines is that tiiey contain a great deal of information tiiat is not useful for 
scholarly research. This is especially problematic for norice researchers 
who may have difficulty differentiating between authoritative and nonau- 
thontative sources. We will discuss some ways in which these tools may be 
productively used later in this chapter. 

Although it makes sense to begin reviewing die literature by consulting 
popular sources, it is important to realize tiiat the conversation between 
experts m the field that creates knowledge occurs only in the scholarly lit- 
ei ature - These Professional publications have undergone a peer referee or 
review process prior to publication. This review conveys at least a certain 
degree of validity to these studies. Scientific publications include articles 
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Examples of Scholarly Economics Journals _ 

National Journals: American Economic Review 

Journal of Political Economy 
Quarterly Journal of Economics 

: Econometrics /^ 

Review of Economics and Statistics > . 
Review of Economic Studies 

Regional Journals: Economic Inquiry : 

* Southern Economic Journal 

Eastern Economic Journal 

Specialized Journals: Journal of Urban Economics i; 

Journal of Money, Credit and Banking 
1 i Journal of Industrial Economics , : 


in professional journals, scholarly books, monographs, working papers, 
and some government documents. Monographs are short books, typically 
no more than one hundred pages, on a single (hence “mono”) scholarly 
topic. Working papers are scholarly articles in progress—prior to final 
publication. Table 3.1 gives examples of scholarly economics journals pub¬ 
lished in the United States. 1 

Many professional journals are difficult for undergraduates to under¬ 
stand, and as we noted earlier, Chapter 6 will explain how to decipher 
them. You should be aware that some journals are more readable than oth¬ 
ers. 2 For example, each regional Federal Reserve Bank publishes an Eco¬ 
nomic Review. Publications of the Congressional Budget Office are also 
quite good. In addition, publications of the various think tanks such as the 
Brookings Institution, the American Enterprise Institute, the Economic 
Policy Institute, and the Cato Institute tend to be more accessible. How¬ 
ever, it is important to remember that think tanks are in business to pro¬ 
mote their particular point of view. Thus, one should read their 
publications more carefully and critically than more objective sources. 

How to Search: Developing an Effective 
Search Strategy 

To locate information efficiently, one needs to use a search strategy. 
This section will explain how to develop one by using a sample research 
question: “To what extent was the 2001 U.S. economic slowdown caused by 
the decline in the stock market?” 
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AEA/JEL/EconLit Subject Descriptors 


A - General Economics and Teaching 
B - Schools of Economic Thought and Methodology 
C - Mathematical and Quantitative Methods 
D,- Microeconomics 

E- Macroeconomics and Monetary Economics 
F - International Economics 
G - Financial Economics 
H —Public Economics ■ 

I - Health, Education, and Welfare ... 

J - Labor and Demographic Economics 

l< ~ Law and Economics V'T ■ Tv' i 

L- Industrial Organization v ,v : 

M - Business Administration and Business 

N - Economic History .; ww. \ T-T (T:!v.,'T-. 

,0 - Economic Development, Technological Change, and,Growth 

P - Economic Systems : 

Q -Agricultural and Natural Resource Economics 

i ' 

R-. Urban, Rural, and Regional Economics 

Z- Other Special Topics ■■ j:^ : : .: T T 


There are two general approaches to locating information. Ackermann 
and Hartman (1998) describe them as browsing and keyword searches. 
You can think of them as emphasizing human capital (or thinking) versus 
physical capital (or brute force). Neither is superior for all searches; rather 
a key task in developing a search strategy is to determine tire optimal com¬ 
bination of the two for a given search. 

Browsing 

Browsing means manually examining a document (e.g., a printed bibli¬ 
ography or an online directory) for useful information or references to use¬ 
ful information. Effective browsing requires you to think carefully and 
logically about how information about a discipline is organized. The Amer¬ 
ican Economic Association (AEA) uses a hierarchical system to classify sub¬ 
ject information in economics, an overview of which is shown in Table 3.2. 3 
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Expanded AEA Subject Descriptors 
for A - General Economics and Teaching 


A00 - General 


A1 - General Economics 
A10 - General 

A11 - Role of Economics; Role of Economists 
A12 - Relation of Economics to Other Disciplines 
A13 - Relation of Economies to Social Values 
A14 - Sociology of Economics 

' V V ; A19 - Other Y- v,; ' 

A2 - Teaching of Economics 
A20 - General 

A21 - Pre-college , v ' 

A22-Undergraduate 

A23 - Graduate /f] 

A29 - Other • " 


Each of these general headings can be further expanded, as illustrated in 
Table 3.3 for the category “A - General Economics and Teaching.” The 
complete classification system can be reviewed at http://www.aeaweb.org/ 
journal/elclasjn.html. By knowing this classification system, you may be able 
to find useful information on a research topic more easily. 4 

There are two journals you should be particularly aware of as you begin 
your research. The Journal of Economic Perspectives (JEP) is designed as a 
general-interest journal for economists. As such, the majority of articles it 
publishes are summaries of current thinking by economists on a given 
topic. Similarly, the Journal of Economic Litemture (JEL) primarily publishes 
survey articles of the literature on specific topics in economics. The Amer¬ 
ican Economic Association publishes both journals, and their tables of 
contents for the past several years are available at the AEA website 
(http://www.aeaweb.org/jep/contents and http://www.aeaweb.org/ 
journal/contents) , 5 If you can find an article on your research topic in ei- 
dier of these journals, you will be ahead of the game since they will almost 
certainly identify and discuss the relevant major studies as of the publica¬ 
tion date of the article. You will still need to look for more recent studies, 
however. 

More generally, when you locate a useful journal article, you should 
consider the sources it references as obvious targets for your review. Bear 
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in mind, however, that since the article’s author may not have pursued tire 
same research question as you, its references should be a complement to, 
not a substitute for, your own literature survey. In sum, browsing is more 
likely to give you useful information than will simple keyword searches, but 
your results Mil depend on the judgment of die author or editor of the 
document you browsed. 

Sample Browse Search 

Suppose you are researching the extent to which the recent U.S. 
economic slowdown was caused by the decline in the stock mar¬ 
ket. You are aware that the life-cycle model of consumption theory 
suggests that household wealth is an important determinant of con¬ 
sumer spending. A review of the contents of the Journal of Economic 
Perspectives reveals an article by James Poterba titled "Stock Market 
Wealth and Consumption" in the spring 2000 issue. The article in¬ 
cludes two pages of references going back to Ando and Modigliani's 
classic 1963 paper on which the life-cycle model is based. 

Keyword Searching 

The alternative way to locate information is by performing a keyword 
search. Keyword searches use search engines on the World Wide Web or 
on specialized databases. These latter include bibliographic databases, 
such as EconLit (formerly the Economics Literature Index) , which consist of 
citations and abstracts only, as well as full-text databases, such as Dowjones 
Interactive, which contain entire documents. Access to specialized data¬ 
bases is provided by a number of commercial vendors, such as OCLC First- 
Search or DIALOG. Researchers can obtain access to many of these 
databases through their university library or computer network. 

Keyword searching allows you to examine far more documents than 
would be feasible by browsing, even assuming you knew about them. The 
major web search engines claim to index all the documents on the Inter¬ 
net, while search engines in electronic databases index all the documents 
in tire database. 6 For example, EconLit currently indexes more than six 
hundred economics journals, seventeen hundred new books, and nine 
hundred new dissertations per year—for nearly a complete record of all 
English-language publications in economics since 1969. As such, EconLit 
should be one of the first places you start searching when researching in 
economics. 

One exceptional tool for keyword searching is the Social Science Citation 
Index (SSCI). The SSCI allows you to search for citations both backward and 
forward in time. In our earlier discussion of browsing, we noted that when 
you find a useful publication, say DeLoach (2001), it makes sense to review 
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| NOTES FOR NOVICE RESEARCHERS 


Do Wot Limit Your Search to Full-Text Databases! 

Undergraduates sometimes view the literature search as nothing 
more than a hurdle on the way to completing a research project. (This 
is not true—indeed, as we have tried to explain, discovering what 
other researchers have done in the field can be a major help in 
completing your own research.) As a consequence, such students may 
limit their search to full-text databases. After all, they reason, even if 
you find something in a bibliographic database, it’s still possible it 
would be.difficult or impossible to acquire. This is poor judgment! The 
goal of a literature search is to find the most important studies done on 
a topic, not merely the easiest ones to obtain. Put another way, if there 
is a major study that you fail to find because it was not cataloged on a 
full-text database, your research will have a substantial flaw . 7 Nearly all 
coEege and university libraries subscribe to the major economics 
journals. This means articles from those journals are available in 
printed form, microfiche, or film at your library. Even if your library 
does not subscribe to a journal that you need, it's possible that one of 
your faculty members does. It never hurts to ask. Limiting yourself to 
fuE-text databases wiE also virtuaEy rule out any books as sources. 
FinaEy, it is hard to imagine a reference that you could not obtain 
through interlibrary loan. In short, don’t limit yourself to fuE-text 
databases! It’s false economy—it wiE cost you in the end more than 
it’s worth. 


the earliei studies that it cites. SSCI can do this for you electronically, since 
fox every journal article or book it catalogs, it also indexes the pubhca- 
tion s citations. Doing this electronically can be a bit faster than reading a 
study’s bibliography, but the real advantage of SSCI is in doing forward 
seaiches. That is, the SSCI database also indexes all die publications that 
cite DeLoach (2001) in their reference lists. As a result, it is very easy to get 
a complete list of the major studies done on a topic. 

An additional advantage of the SSCI is that it is interdisciplinary; it cov¬ 
ers a variety of social science fields. What this means is that you are less 
likely to miss a study done on your topic but in a field other than eco¬ 
nomics, as could occur if you only searched EconLit. The main disadvan¬ 



tage of the SSCI is that it is quite expensive, so many colleges and univer¬ 
sities do not subscribe to it. If your institution has it, you should use it. 

Though your scholarly literature survey should concentrate on using 
specialized databases to find literature, web search engines can also be 
profitably employed. The best web search engine for academic uses is 
probably Google (http://www.google.com) . A criticism of Web-based re¬ 
search is that the majority of what is on the Web is not scholarly in nature. 
One advantage of Google is that in addition to indexing webpages, it also 
catalogs a number of other file types, including Adobe document (.pdf) 
files, Word and WordPerfect documents, Excel and Lotus 123 spread¬ 
sheets, PowerPoint presentations, and others. As a result, a researcher 
could use Google to find scholarly working papers or data sets on re¬ 
searchers’ personal webpages, even before they are published. 

The principal disadvantage of keyword searches, when using special¬ 
ized directories or web search engines, is that they may generate many 
“hits,” but only a few useful ones. Imagine doing an electronic search us¬ 
ing the phrase stock market. Depending on the database, you could get hun¬ 
dreds or even thousands of results. Even if there are many relevant hits, 
you will need a great deal of time and effort to separate the wheat from the 
chaff. Consider the way commercial fishermen use a dragnet. The net 
scoops up everything in the water for several miles around. Then even 
though the fishermen only want codfish, they need to examine and throw 
away everything else they catch. 

The solution to this quandary is to perform a more advanced keyword 
search using a combination of Boolean and phrase searching. Boolean 
searching allows you to focus the search in the way most likely to obtain 
useful results. There are three Boolean operators: AND, OR, and NOT. 

AND is used to narrow a search. For example, a search on the key¬ 
words Keynesian AND Post would locate only those items including 
both keywords, such as “Post Keynesian.” 

OR is used to widen a search. A search on the keywords Monetarist 
OR Keynesian would locate those items including either keyword. 

NOT means exclude. It includes all items except those that contain 
the NOT keyword. For example, a search on the keywords Keyne¬ 
sian NOT Post would identify all items with “Keynesian,” except 
those with “Post Keynesian.” 

You can also fine-tune your search using nested Boolean logic. For ex¬ 
ample, a search on the keywords Monetarist OR ( Keynesian NOT Post) would 
locate all items including either “Monetarist” or “Keynesian,” except those 
with “Post Keynesian.” 
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Phrase searching looks for the items that contain an exact phrase that 
you specify within quotes. For example, a search on “Post Keynesian" would 
locate only those items with that exact phrase, but it would exclude items 
that include those two keywords separately. A wildcard character can allow 
you to search more efficiently by truncating your search phrase. If the wild¬ 
card character for a search engine is *, then searching on die keyword 
Keynes*would locate items containing “Keynes” or “Keynesian” or “Keyne¬ 
sians. Similarly, a search on die keyword wom*n would yield items con¬ 
taining “woman” or “women.” 

A Basic Search Strategy 

Ackerman and Hartman (1998) propose a basic search strategy, which is 
shown (in a revised form) in Table 3.4. Though die table gives the impres¬ 
sion that this is a straightforward process, in practice it is a great deal more 
uncertain. For one tiling, at the beginning of die process die researcher may 
not know exacdy what to look for. This is especially true for a researcher new 
to the topic. For another, die process is iterative. You will never find every¬ 
thing in a single search! Rather, you will need to Uy an initial set of keywords 
with one database. Once you find a few good sources, read them. As you be¬ 
come more familiar with the literature, other ideas for keywords will come 


BBBBjl A Basic Search Strategy _ 

1. . Begin by stating your research topic or question; 

2. Identify important concepts relevant to your topic; • \ 

3. Brainstorm to create a list of keywords that describe these concepts; 

4. Determine any synonyms to these keywords; 

5. Choose a subject approach; that is, determine which discipline or 
disciplines are likely to have literature on your topic; for example) 

■ economics, law, public policy, business, sociology, education; 

6. Determine which search features may apply; for example. Boolean 

; operators, wildcards; ■ v 

7. Choose an appropriate database to search the given subject; for ex- 

■; ample, EconLit, . Yk ;k ;y yc 

8. Read the search instructions for the database; 

9. Create a search expression using the appropriate syntax; 

10. View the results; 

11. Modify the search if necessary (return to Step 2); 

12. Try the same search with another database (return to Step 6). 



to mind, which you can tiien search with. If one database is unsuccessful, uy 
anodier. If one subject approach doesn’t yield enough, uy a related field. 
There is an art to keyword searching, and some researchers are better tiian 
otiiers at it. Fortunately, anyone can become better witii practice. 

Sample Keyword Search 

Let’s perform a keyword search on our sample research topic using die 
search suategy oudined in Table 3.4. 

1. Recall that die research topic is the extent to which the 2001 eco¬ 
nomic slowdown was caused by a decline in the stock market. 

2. The research topic can suggest die important concepts to begin with. 
These could include stock market decline and economic slowdown. 

3. A litde brainstorming about possible connections between the two 
can suggest other keywords to search for: household wealth, con¬ 
sumer spending, and life-cycle model. Note also that in choosing 
keywords, one can select the JEL subject classification for one’s 
topic. In diis case, we could choose G100 for “General Financial 
Markets” or E200 for “Macroeconomics: Consumption, Saving, Pro¬ 
duction, Employment, and Investment, General.” 

4. Synonyms for (stock market) decline could be crash or correction or 
bear market. A synonym for (economic) slowdown could be recession. 
Synonyms for wealth could be savings or saving. Synonyms for con¬ 
sumer spending could be consumer expenditure or consumption. 

5. The research topic suggests two fields whose literature might prove 
useful: economics and business. 

6. For multiple-word expressions we will need to use phrase searching, 
for example, “ consumer spending" For synonyms we will need to use 
Boolean operators, for example, “ consumer spending' OR “ consumer 
expenditure ” OR “ consumption .” For relationships we will need to use 
the Boolean AND operator, for example, “consumption AND wealth." 

7. When performing economics research, a good database to begin 
with is EconLit. For other fields, die following might be helpful: 

• Expanded Academic ASAP- for business academic journals, 

• Wilson Business Abstracts - for business academic journals, 

• Business & Company Resource Center- for business information at 
the company level, 

• ERIC - for education, 

e PAIS - for public affairs and international studies, 

• Dow Jones Interactive - for current topics not yet in the scholarly lit¬ 
erature, also business academic journals, 

• SSCI- for any of die social sciences. 
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Be aware that the user interface and the format in which search re¬ 
sults are provided can differ depending on which vendor supplies 
your access to the database. 

8. The search instructions for EconLit are available at http://www 
.econlit.org/econlit/hints.html. 8 

9. An initial keyword search was performed using the expression “con¬ 
sumption AND wealth." 

10. This search yielded 1,422 hits, which is too many to review mean¬ 
ingfully. 

11. A revised search using the expression “consumption AND stock market" 
yielded 97 hits, including a promising article by Shirvani & Wilbratte 
(2000), “Does Consumption Respond More Strongly to Stock Mar¬ 
ket Declines than to Increases?” A subsequent search using the ex¬ 
pression “consumer spending (OR “ consumption”) AND recession” 
resulted in 78 hits, including another good one, Blanchard (1993), 
“Consumption and the Recession of 1990-91.” Another search using 
the expression “stock market AND recession” yielded 13 hits of which 
three look useful. 

12. Let’s try searching another database: Expanded Academic ASAP. A 
search using the expression “consumer spending (OR “consumption") 
AND recession" yielded 22 hits, of which several appear interesting. 
One of these was Blanchard (1993), which we found previously. 

You should try all the search expressions that generated good results 
(i.e., not too many hits, and many of them useful) on each of the databases 
searched. At some point, you will begin to turn up the same sources, like 
Blanchard (1993). This is a sign that you’re near the end of effective 
searching. Notice that our keyword searches did not turn up the Poterba 
(2000) paper we discovered earlier by browsing. That is why both types of 
searches are necessary. After completing the browsing and keyword 
searches, and examining the references cited by the studies found there, 
we can conclude that we have found the major published studies to date 
on our research question. 

Obtaining the Resources 

Once you have identified potential sources, you need to obtain them. 
The World Wide Web has revolutionized access to research materials, es¬ 
pecially for scholars at smaller institutions whose libraries may have only 
limited collections. More and more resources are available online, includ¬ 
ing full-text books and journal articles. 

lib 


Obtaining the Resources 


More dian a hundred economics journals are available free in full-text 
format. One list of such journals is available at tire Resources for Economists 
webpage at http://rfe.org. Another list is maintained at the College of 
Wooster (http://www.wooster.edu/economics/arcliive/journals.hmi]). 

There are also a number of proprietary portals to full-text journals. For 
example, the American Economic Association hosts a web portal 
(http://www.aeaweb.org) that includes full-text access to tire past three 
years of tire journals it publishes: tire AER, JEL, and JEP. Access to tire jour¬ 
nals’ tables of contents and abstracts is available to anyone, but full-text ac¬ 
cess requires AEA membership. The site also includes Bill Goffe’s Resources 
for Economists on the Internet, which will be discussed in detail in Chapter 9. 
JSTOR is an archive of full-text scholarlyjournals, including (as of this writ¬ 
ing) twenty-four top journals in economics, which are listed in Table 3.5. 9 
Many colleges and universities subscribe to JSTOR. AEA members can ac¬ 
cess JSTOR via a modest annual fee ($10). There are several other pro¬ 
prietary' online databases that offer full-text journal articles. One of these 
is Expanded Academic ASAP, which includes the full text of many economics 
journals that are not included .in JSTOR. 

In addition to the obvious speed and convenience that online access to 
journal articles provides, in the near future we will be able to link directly 
to references cited in those articles (backward linking). It may even be pos¬ 
sible to link to articles citing those articles (forward linking). 

Government documents are also increasingly available over tire Inter¬ 
net. In the past decade tire U.S. government has made a significant effort 

Economics Journals Available Through JSTOR 


American Economic Review 
Brookings Papers on 
Economic Activity 
Canadian Journal of Economics 
Econometrica ' 

Economica a / ' 

Economic History Review 
: Economic Journal 
International Economic Review 
Journal of Applied Econometrics 
■ Journal of Economic History ' 
Journal of Economic Literature 


Journal of Economic Perspectives 
Journal of Finance 
Journal of Human Resources 
Journal of Industrial Economics 
Journal of Money, Credit, and 
, Banking f:r. yA 
Journal of Political Economy 
Oxford Economic Papers 
Quarterly Journal of Economics 
Rand Journal of Economics 
Review of Economics and Statistics 
Review of Economic Studies 


to make virtually all print documents accessible online. Moreover unlike 
their printed counterparts, they are obtainable for free. A good point to 
begin searching for government publications is at www.firstgov.gov. 

The fix st place to look for books is still your college or university library 
To date, the availability of books online is lagging behind the availability 
of journal articles and government documents. One very promising 
source for electronic books, however, is NetLibrary (www.netlibrary.com), 
a subsidiary of On-line Computer Library Center (OCLC). NetLibrary of- 
feis electronic texts on a subscription basis to libraries. As of this writing, 
NetLibrary has over fifty-four thousand titles in a variety of fields includ¬ 
ing economics. Just as you would search for a printed book, you use your 
college or university library’s search engine to find an online text. Indeed, 

! until you find the entry, you won’t know whether the book is a hard copy 
or an ^electronic one. Once you locate the electronic copy, you can “check 
it out” just as you would a printed copy. Unless your library subscribes to 
more than one copy of the e-book, no one else from your institution has 
access to the book until you relinquish it. An additional benefit is that you 
can obtain the text twenty-four hours a day without having to physically go 
to the library to pick it up. ' ° 

We noted earlier that libraries at smaller institutions may have only lim¬ 
ited collections. There are several ways to get around this, such as making 
use of interlibrary loan services or visiting nearby research university li- 
biaries. For example, it is increasingly possible to search such library cata¬ 
logs remotely to ensure that their collection has what you are looking for 
before visiting the library. 

Thei e are also literally tens of thousands of economics working papers avail¬ 
able online from the National Bureau of Economic Research (www.nber.org), 
the Social Science Research Network (http://papeta.ssm.com), the Working 
Papers Archive at Washington University in St. Louis (http://econwpa.wusd. 
edu), and NetEc (http://netec.wusd.edu/NetEc.html). 

SUMMARY 

* In order to Perform original research, one needs to determine what is 
cunently known on a topic. Thus, at the beginning of any research pro¬ 
ject the researcher needs to survey the “literature.” 

" There are two sources of information on a topic: popular and scholarly. 
Only the latter is considered part of the literature. 

An effective search strategy includes the following steps: 


Notes 


8 Start with secondary sources in the general topic area. 

8 Search for survey articles on the topic, for example, in JEP or JEL, 

8 Move to primary sources: for example, EconLit or Social Science Citation 
Index 

8 Use a combination of keyword searching and browsing from the ref¬ 
erences of the useful sources you’ve located. 


NOTES 

1. Appendix 10 of Wyrick (1994) provides an extensive listing of eco¬ 
nomics journals grouped by subject area. 

2. Note also that working papers tend to be more readable than pub¬ 
lished articles because they usually include many more details of the 
argument details that are eliminated during the editorial process. 

3. This is also referred to as the EconLit or Journal of Economic Literature 
classification system. 

4. For example, you can do online searches using these classifications as 
keywords. 

5. The AEA website lists the tables of contents for the Journal of Economic 
Literature back to December 1994 and for the Journal of Economic Per¬ 
spectives back to the beginning of 1998. One can easily get the contents 
from earlier issues of the journals by searching EconLit, using the jour¬ 
nal name as the “Source” keyword. 

6. Note also that you should always use multiple web search engines since 
they use different search criteria and their databases, while overlap¬ 
ping, are not identical. 

7. In my courses, such a flaw would cost at least a letter grade on the fi¬ 
nal paper. 

8. Note that databases like EconLit are available through more than one 
interface. For example, at my institution, we get it from OCLC First- 
Search, a database service provider that provides access to forty-four 
databases on a wide variety of subjects. 

9. Note, however, that articles do not become available through JSTOR 
until five years after original publication. 

10. Note that Turabian’s Chapter 8 was published before the 1999 MT.A 
decision to recommend parenthetical references. 
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Exercises 


SUGGESTIONS FOR FURTHER READING 

Ackermann and Hartman (1998)— Excellent, user-friendly guide to online 
searches and the Internet as a research tool. 

Turabian (1996 )—Definitive guide to wiring and citation styles for re¬ 
search papers and theses. See also Appendix 3A. 

EXERCISES 

1. Select a research question, preferably one you are actually researching. 
Write dowi the research question. Review the tables of contents for tire 
Journal of Economic Perspectives and the Journal of Economic Literature 
(provided at http://www.aeaweb.org/jep/contents and http://www. 
aeaweb.org/jonrnal/contents) . Browse those tables of contents for one 
or more survey articles that are relevant to your research question. (You 
should identify at least one article that is relevant to your research.) 
Write down the complete bibliographic citation information for the sur¬ 
vey articles you discover, using the citation style your instructor favors. 
(See Appendix 3A for information on citation styles.) 

2. Select a research question, preferably one you are actually researching. 
Write dowr the research question. Use tire basic search strategy illus¬ 
trated in Table 3.4 to perform a keyword search in EconLit. Write down 
a list of important concepts” drat are relevant to your research ques¬ 
tion. Select and write down appropriate keywords. Create a search ex¬ 
pression, and execute the search. Modify dre search as necessary to find 
at least five useful hits for your research. Write down the complete 
bibliographic citation information for the studies you discover, using 
the citation style your instructor favors. 

3. Select a research question, preferably one you are actually researching. 
Write down dre research question. Use the basic search strategy illus¬ 
trated in Table 3.4 to perform a keyword search using a database other 
than EconLit. Write down a list of “important concepts” relevant to your 
research question. Select and write down appropriate keywords. Create 
a search expression, and execute die search. Modify the search as nec¬ 
essary to find at least five useful “hits” for your research. Write down the 
complete bibliographic citation information for the studies you dis¬ 
cover, using dre citation style your instructor favors. 

4. Select a research question, preferably one you are actually researching. 
Write down the research question. Use the basic search strategy illus¬ 
trated in Table 3.4 to perform a keyword search using a web search en¬ 
gine like Google. Write down a list of “important concepts” relevant to 



your research question. Select and write down appropriate keywords. 
Create a search expression, and execute the search. Modify the search 
as necessary to find at least five scholarly “hits” for your research. Write 
down the complete bibliographic citation information for the studies 
you discover, using the citation style your instructor favors. 




PW W- appendix 

listMi-- 


Scholarly References 
and Citation Styles 


An essential part of reviewing literature is keeping careful references 
for each scholarly work you use. The purpose of citation styles is to provide 
a standardized, concise way of doing that. References are important for 
two reasons: first, to give credit to previous authors for their ideas, their in¬ 
tellectual property, and, second, to help readers track down those ideas in 
full detail. We will expand on these reasons later in Chapters 5 and 6. Here 
we introduce the styles. 

There are three general styles for citations and references: they are 
known as MIA, the Chicago Style, and APA. Each of diese styles explains 
its preferred way to write scholarly notes in tire text (e.g., foot- or end- 
notes) and scholarly references at the end of the text. All three styles now 
recommend using the parenthetical form for references—for example, 
“Turabian (1996)”—instead of citing references in the text through foot¬ 
notes or endnotes. Of course, for scholarly notes that elaborate on the 
text, it is still appropriate to use footnotes or endnotes. 

An excellent guide to the MIA and Chicago styles is Kate Turabian’s A 
Manual for Writers of Term Papers, Theses and Dissertations, Sixth Edition 
(1996). Ms. Turabian was the graduate secretary at the University of 
Chicago; she was responsible for accepting all theses and dissertations sub¬ 
mitted at that institution and as such was the acknowledged expert on the 
Chicago Manual of Style. Thus, her book is essentially a condensed version 
of the Chicago Manual. Two chapters that are particularly useful are Chap¬ 
ter 11, which shows a comparison of the two citation styles, and Chapter 
14, which illustrates the two styles in a large number of sample notes and 
references. Note that this edition of Turabian is based on the Fourteenth 
Edition of the Chicago Manual of Style, rather than the most recent Fif¬ 
teenth Edition. The major changes in the new edition have to do with on¬ 
line resources, which were not covered in the Fourteenth Edition, so 
Turabian is still useful. Hopefully, a new edition of Turabian will be forth¬ 
coming. The following paragraphs introduce the three styles and refer you 
to various guides to each. 

MLA is the acronym for the Modern Language Association and is the 
style favored by scholars in the humanities disciplines. This style is spelled 
out in Chapter 8 (on footnotes) and Chapter 9 (on bibliographies) of 
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Turabian (1996). 10 Additional guides to the MIA style are available from 
tire University of Wisconsin Writing Center (http://www.wisc.edu/ 
writing/Handbook/DocMLAhtml) and the Writer’s Workshop at tire 
University of Illinois (http://www.english.uiuc.edu/cws/wworkshop/ 
MLA/bibliographymla.htm). 

APA is tire acronym for tire American Psychological Association and is 
the style used by scholars in psychology and other social sciences, including 
some economists. Useful APA guides are available from tire On-line Writ¬ 
ing Lab at Purdue University (http://owl.English.purdue.edu/handouts/ 
research/r_apa.html), the University of Wisconsin Writing Center 
(http://www.wisc.edu/wiiting/Handbook/DocAPA.html), and the Writer’s 
Workshop at the University of Illinois (http://www.english.uiuc.edu/cws/ 
wworkshop/bibliography_style_handbookapa.htm). 

The Chicago style is the choice of the natural sciences and many social 
sciences, including economics. The Chicago Style is formally called the 
author-date system in tire Chicago Manual of Style, Fifteenth Edition (2003). 
In this style, references are listed at the end of a work in the “Reference List” 
(not tire bibliography). This style is explained in Chapter 10 of Turabian 
(1996). Air additional guide to tire Chicago style is available from the Long 
Island University library: http://www.liu.edu/cwis/cwp/library/workshop/ 
citchi.htm. 

Given that most literature surveys are now conducted online, you 
should be aware that online citation styles may not be adequately spelled 
out in the guides just mentioned. A good source of information about 
citation styles for online documents is Andrew Harnock and Eugene 
Kleppinger’s, Online: A Reference Guide to Using Internet Sources, available at 
http://www.bedfordstmartuis.com/online. 

Note that ajournal article (or book) found online is still a journal arti¬ 
cle (or book) and should be cited accordingly. That is, online citation 
styles should be reserved for documents that have no print analog. For ex¬ 
ample, an article in the American Economic Review that you obtain online 
from JSTOR should be cited as ajournal article rather than as an online 
document. 




The important thing in science is not so 
much to obtain new facts as to discover 
new ways of thinking about them." 

SIR WILLIAM BRAGG 




If n Chapter 1 we observed that scholars create knowledge by construct- 

| ing competing arguments. They construct these arguments by using 
Elslsfour tools: 

Mental processes—thinking about an argument; 

Oral discourse—verbalizing the argument; 

Mathematics—manipulating equations to derive meaning; and 
Writing. 

Though arguments can be developed using all of those tools, in this 
chapter we focus on writing as the principal one. McCloskey (2000) 
makes this point early in her book Economical Writing, when she asserts 
that the practice of economics depends more on writing than on math¬ 
ematics or statistics or any of the other technical skills normally associ¬ 
ated with economics! 

Economists use writing for two purposes, the second of which you may 
not have considered: 

Writing as a Product, a form of communication to disseminate re¬ 
search results, and 

Writing as a Process for deriving the research results. 

This chapter will focus on the second purpose, while the first will be the 
subject of Chapter 5. 
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The Structure of an Argument 


Writing to Learn 

Writing is more than a product; it is also a process or a tool for creat¬ 
ing knowledge. Students sometimes say, “I know it, but I just can’t explain 
it.” But if you can’t explain an economic idea, you only know it on a su¬ 
perficial level. Whiting forces you to think concretely, to figure out exacdy 
what you mean. By contrast, when using purely mental processes, it is easy 
to be vague with your thoughts, to leave pieces out. When you write, those 
holes in the logic become readily apparent. In this context, writing is a 
tool of discovery, a way of working through ideas that you don’t fully un¬ 
derstand. 1 In other words, writing is a positive-sum game. When you write, 
you don’t merely put down what you already know; rather, you end up 
knowing more. 

Composition As a Creative Process 

. Let’s explore this idea. The process of writing is called composition. What 
does that mean? Composition includes both analysis (taking something 
apart to understand it) and synthesis (putting pieces together to make a 
whole). Consider how you as a student “put together” a traditional term pa¬ 
per based on notes written on index cards. You have to sort through the in¬ 
formation, classify it, and decide what is and is not relevant. You arrange and 
rearrange so as to discover a structure, an organization that best displays tire 
information on the index cards until a meaning reveals itself. In other 
words, composition involves searching for relationships among the facts and 
ideas that make up the raw material of your research. Wlren you compose, 
you choose and arrange those facts and ideas; you try to put them in the 
most meaningful order. It is the same when a director makes a film. The film 
is never completed in only one “take.” Rather, different takes are filmed, 
edited, and combined in differentways until just the rightstoryis developed. 

Students often end this composition process too early, before it fully re¬ 
veals its insights. Bean (1996, 7) notes, “A key observation among teachers 
of critical thinking is that students . . . tend to reach closure too quickly. 
They do not suspend judgment, question assumptions, imagine alterna¬ 
tive answers, play with data, enter into the spirit of opposing views, and just 
plain linger over questions.” This problem is understandable if one views 
writing only as a product. Bean (1996,15) characterizes this mistaken view 
with a metaphor: “Writing is like the box and wrapping paper into which 
we put our already formulated ideas.” 

In fact, composition is like a fine wine, which needs time to mature and de¬ 
velop into its full flavor. This explains why it is important to start an assignment 
early, write a draft, and then take time off from your writing to allow your sub¬ 



conscious to mull over what you’ve come up with. Bean calls this “incubation.” 
As journeyman writers, you need to make a special effort to maintain an open 
mind about your topic and not allow yourself to reach a conclusion too soon. 

In this light, you should view revising a paper as far more than editorial 
correction. It means re-viewing the information, re-thinking the way it is 
organized, re-constructing the argument, looking for new patterns of 
meaning. It means asking, “Do I have the best interpretation of the facts, 
or is there another that would do a better job?” This is hard work. You must 
take an idea, and do tire best you can to build an argument for it. Then, 
once you’ve made a commitment to it, you need to step back and honestly 
scrutinize what you’ve written. This approach to revision is one reason why 
instructors emphasize writing multiple drafts of a paper: so you consider 
and explore different ways of thinking about tire topic. We will discuss 
multiple paper drafts in more detail in Chapter 5. 

One implication of this creative view of writing is that almost certainly 
you will have to “throw away” some of what you write, even though this is 
difficult to do. Even experienced writers have a sense that if they throw 
away parts of their writing, especially parts they have labored over, they are 
throwing away something of themselves. This difficulty may come in part 
from the misconception that term papers must include all the facts you 
discovered (and wrote on index cards) in the preliminary stages of the re¬ 
search. This is simply not so! A well-written paper is structured to create an 
argument. Recall the nature of a theory or a model. A model is a simpli¬ 
fied version of reality that provides insight into the question being re¬ 
searched by focusing on the most important aspects of the question, while 
leaving out the less important aspects. In this respect, an argument is like 
a model; it provides a structure or story to best explain the facts, but prob¬ 
ably doesn’t explain or address them all. This is why “throwing away” por¬ 
tions of your writing that upon review don’t contribute significantly to the 
argument is a natural part of tire composition process. 

The Structure of an Argument 

The purpose of scholarly writing is to make an argument that is per¬ 
suasive to experts in the field. Scholars use the composing process, de¬ 
scribed in the previous section, to discover and refine their arguments. 
When completed, scholarly writing follows a logical, hierarchical struc¬ 
ture, in which the thesis is supported by a series of nested arguments that 
lead logically to the thesis as a conclusion. This is illustrated in a stylized 
way in Figure 4.1, which shows the pyramid structure of a well-developed 
student argument. You should note that actual arguments will be less me¬ 
chanically structured and more individual in character. 
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NOTES FOR NOVICE RESEARCHERS 

Composing with the Outline Function 
of Your Word Processor 

The "Outline" function of your word processing software can be a 
very useful tool in the process of composing and revising your paper. 
For example, in Microsoft Word, the outline function allows you to 
convert your paper draft into an outline of the major points you made 
in the paper. The text for each section can be collapsed under each 
heading (or expanded to show the complete text). Once the paper is in 
that form, it becomes very easy to reorganize the paper without having 
to cut and paste. Alternatively, the outline function can allow you to 
create an outline of your paper that consists of the major points you 
plan to make, and then flesh out those points to write the complete 
paper. To use the outline function in Word, open your document, then 
select "View" from the pulldown menu on the toolbar, and then select 
"Outline.” Though the outline function can take a short while to 
master, it is well worth the effort. Indeed, I used Outline to organize 
and write this chapter. 


The thesis or primary assertion is at the top, the major points are un¬ 
derneath the thesis, and supporting explanation or evidence is under each 
major point. Nothing is added at random. Any points (or stray facts you 
discovered in tire preliminary research) that do not lead to the thesis ei¬ 
ther directly or indirectly are omitted from the argument. 

Let’s examine this structure in more detail by looking at die nature of 
an argument. In Chapter 1, we defined an argument as an assertion sup¬ 
ported by reasoning or evidence. An assertion is a claim or point of view. 
For example, McCloskey’s statement mentioned at the beginning of this 
chapter is an assertion. In casual conversation, people often argue on tire 
basis of assertions alone (what Missimer [1995] calls loose arguments ). 
Similarly, in professional or scholarly writing authors might refer to Mc¬ 
Closkey’s “argument” that writing is more important to the work of econ¬ 
omists than technical skills. But argument in the strictest sense refers to 
botii tire assertion and die supporting reasoning.' 



The Structure of an Argument 


Figure 4.1 Pyramid Structure of an Argument 


Major Reasons 
Less Caregiver 
Concern and 
Attention 


Supporting Points 

Low paid caregivers will lack the emotional 
investment in the child’s well-being that is 
associated with high levels of cognitive 
development 

Even highly motivated caregivers won't be 
able to devote large amounts of time and 
attention, given the large number of children 
they care for 


PRIMARY ASSERTION 

Children's Cognitive . 
Development Is Harmed 
When "y- ;: T' , 
Mothers Work 


Less Interaction 


Less time spent involving intellectual 
stimulation (e.g., reading to/with a child) 

Less opportunity for one-on-one 
interaction with the child 


!',j 


Lack of Stability 
in the Child's Life 


Turnover in employment at daycare centers 
means a child may repeatedly lose adult to 
whom he/she has become attached 


Child's movement between daycare centers 
may have similar consequences 


What Does It Mean to Say That a Conclusion 
Follows from the Evidence? 

For an argument to be persuasive the reasons supporting it must be true, 
and the conclusion must follow from those reasons. What does it mean to 
say that a conclusion follows from die evidence? An inference is a conclusion 
reached after reasoning logically about facts and relationships. If we define 
an argument as a claim supported by evidence, then an inference is like an 
argument turned backward. In other words, given certain facts or assump¬ 
tions, given certain relationships between the facts, if we can reason our way 
to the inference that is, if the evidence leads us logically to the inference as 
a conclusion, then we say that the conclusion “follows.” If the conclusion fol¬ 
lows, then we have what Missimer calls a “warranted inference. 
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Consider, for example, the argument that consumer spending in¬ 
creased last year because of an increase in the unemployment rate. To as¬ 
sess this argument, one would ask two questions. First, is the reason true?; 
did unemployment increase last year? Suppose tire answer is yes. The sec¬ 
ond question then is: Does tire conclusion follow from the premise? In 
other words, is it reasonable or logical to conclude that an increase in un¬ 
employment would cause individuals to spend more? Probably not! An in¬ 
crease in unemployment would reduce incomes, at least for the 
unemployed. Lower incomes should translate into less spending rather 
than more. In this case, we say that die conclusion doesn’t follow from die 
premise, and the argument fails to convince. By contrast, if the argument 
stated that consumer spending increased because of a decrease in unem¬ 
ployment, then die conclusion would follow, and assuming unemploy¬ 
ment had fallen, die inference would be warranted. 

A logical fallacy is an argument that is flawed because die conclusion 
does not actually follow from the reasons stated, even though die argu¬ 
ment is phrased in a way that makes you think it does. Logical fallacies of¬ 
ten find dieir way into public arguments. Appendix 4A lists the most 
common logical fallacies, which you should learn to identify and avoid. 

Examining an Argument 

Let’s examine die argument made by Robert Samuelson (2002), which 
is reproduced in the following box. 


A War We Can Afford 

By Robert J Samuelson 

Wednesday, September 18, 2002; Page A29 | 

A possible war with Iraq raises many unknowns, but ) 
“can we afford it?” is not one of them. People inevitably ask 
that question, forgetting that the United States has be¬ 
come so wealthy it can wage war almost with pocket 
change. A war with Iraq would probably cost less than 1 
percent of national income (gross domestic product). 
Americans have grown accustomed to fighting with little 
economic upset and sacrifice. 


Samuelson's Major 
Claim: A possible 
war with Iraq 
raises many un¬ 
knowns, but"can 
we afford it?" is 
not one of them. 





Examining an Argument 



The last time die United States truly mobilized for con¬ 
flict was World War II. Roughly 16 million Americans served 
in the military; that was two-diirds of all men from 18 to 34, 
reports historian James Patterson of Brown University. The 
costs were stupendous. In 1944, federal spending totaled 
44 percent of GDP, with military spending at 38 percent of 
GDP. At home, Americans needed ration coupons to buy 
meat, gasoline, and otiier staples. 

Ever since, two tilings have transformed die economics 
of war: The U.S. economy has gotten bigger, and wars have 
gotten smaller. Measured by what it produces—and ad¬ 
justed for inflation—the economy is more than five times as 
large as it was in 1945. Meanwhile, America’s wars have be¬ 
come more localized, draining less of the nation’s wealth. 

In the Korean War, die defense budget reached 14 per¬ 
cent of GDP in 1953, but much of that spending went for a 
massive buildup offerees in Europe. “For every tank that went 
to Korea, two went to Europe,” says Historian Alan Gropman 
of die Industrial College of die Armed Forces. “The B-52s we 
built had nothing to do with the Korean War.” hi August 
1949, the Soviet Union unexpectedly exploded its first atomic 
bomb, prompting President Harry Truman to order the Na¬ 
tional Security Council to undertake a major review of U.S. 
strategy. The resulting document (called NSC 68) envisioned 
huge deployments of U.S. troops to Europe to counter a con¬ 
ventional Soviet attack, which—once the Soviets had their 
own nuclear weapons—seemed more credible. 

Aldiough the Cold War’s costs remained large, defense 
spending during die Vietnam War went only as high as 
9.4 percent of GDP, in 1968. Even so, Lyndon Johnson’s early 
attempt to finance the war without any tax increase—to have 
both guns and butter—helped raise inflation. After Vietnam, 
defense spending (again, as a share of GDP) drifted down 
and dropped sharply once the Cold War ended. 

It now runs about $350 billion annually. That’s a lot of 
money, but, in an economy producing more than $10 trillion 
annually, it isn’t much of a burden. It’s slightiy more than 
3 percent of GDP. How much a war widi Iraq would cost is 
guesswork. It would vastiy exceed the total in Afghanistan, 


Major Piece of the 
Evidence: We can 
measure the finan¬ 
cial burden of a 
war on a nation by 
the cost of the war 
as a share of gross 
domestic product 
(GDP). During 
World War II, the 
cost of the war 
reached nearly 
40 percent of GDP. 


Crux of the Argu¬ 
ment: Since World 
War II, wars have 
become more lo¬ 
calized and less 
costly, while 
the economy 
has grown 
substantially. 


Supporting Point: 
Discussion of the 
Korean War (more 
localized and less 
costly than World 
War II) 


Supporting Point: 
Discussion of the 
Vietnam War 
(more localized 
and less costly) 
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Three Types of Reasoning 




which the Congressional Budget Office estimated at $10 bil¬ 
lion for fiscal 2002. The Persian Gulf War cost $61 billion. 
Even if a new war cost $100 billion, it would be only about 1 
! percent of GDP. 

5 Clearly affordable. Whether we should afford it is another 

question. So is the effect of a war on the economy, which 
; could go either way. Extra spending might help. A swift vic- 
i tory might bolster confidence. Or a war might jeopardize oil 
-j supplies from Saudi Arabia and elsewhere, through terror- 
1 ism or political upheaval. Protracted fighting might hurt 
j confidence, consumer spending and the stock market. 

’j “The economy is growing, but barely,” says Mark Zandi 

of Economy.com. “It wouldn’t take too much for the re¬ 
covery to roll back into recession—or close to it.” 

Oil prices are the biggest vulnerability. In 1990, after 
Saddam Hussein invaded Kuwait, prices doubled, from 
: about $18 a barrel in May to $36 in October. That helped 

tip a weak U.S. economy into recession—though prices 
quickly receded in 1991. Barring a calamity, many econo¬ 
mists minimize the odds of a repetition. 

I If Iraq’s oil exports “are lost, it wouldn’t be a big factor,” 

says John Lichtblau of the Petroleum Industry Research 
v Foundation. Iraq is now exporting about 700,000 barrels a 
’ day out of total world demand of almost 77 million barrels 
daily. The Saudis could offset any shortfall, he says. More- 
:i over, the U.S. economy has become less energy-intensive 
and is less sensitive to higher prices. The economy is grow- 
; ing at an underlying rate of 2 percent to 2.5 percent, says 
economist Nariman Behravesh of tire forecasting firm DRI- 
j WEFA. War fears have already pushed oil prices to about 
$30 a barrel, but even if they rose to $40 for a year, GDP 
:j growth might drop only half a percentage point, he says. 

Because wars surprise, who knows? But the important 
; questions are harder. Is this war justifiable? Should the 
United States go it alone? What will happen if we don’t 
j fight? What will happen if we do? By contrast, economic is- 
sues are a sideshow—and should stay so. If this war is nec- 
j essary, we can afford it. 

Copyright © 2002, Newsweek. Reprinted with permission. 



To summarize Samuelson’s argument: 

Assuming that we can measure tire Financial burden of a war on a 
nation by its cost as a share of GDP, 
then since World War II, U.S. GDP has increased dramatically. 

At tire same time, wars have become more localized and thus less costly; 
therefore, we can conclude that the United States is better able to af¬ 
ford a war with Iraq than we were able to afford World War II. 

Samuelson’s conclusion does follow from the evidence. 


Three Types of Reasoning: Deductive, 
inductive, and Warrant-Based 

Every argument consists of assertions and evidence. Or, looking at it 
backward, ever)' argument consists of premises (or assumptions) and 
conclusions (or inferences). Let’s examine more carefully the ways in 
which scholars reason. There are three different approaches to rea¬ 
soning: deduction, induction, and warrant-based. An argument can in¬ 
clude all three types, but to keep things simple let’s look at them one 
at a time. 

Deductive reasoning starts from one or more general principles and 
derives specific predictions from them. These predictions are what 
Sherlock Holmes called “deductions.” The principles need not be all 
that weighty; you can think of them as premises or assumptions. They 
are tire “givens” in the exercise. A valid deduction is one in which the 
conclusion must follow from the premises. In other words, if the premises 
from which the reasoning begins are true, then the conclusion must be 
true also. In a sense, a deduction is a logical proof. Consider a simple 
example: Start with the premise that all monetarists believe in some ver¬ 
sion of the quantity theory of money. If Milton Friedman is a monetarist, 
then he must believe in the quantity theory of money. The deduction is 
that Friedman believes in the quantity theory of money. Samuelson’s ar¬ 
gument in the article we examined previously was another example of 
deductive reasoning. 

Note that, strictly speaking, a deduction is considered valid as long as 
the conclusion follows from the premises, even if the premises aren’t true. To 
be persuasive, what we really want is a sound argument, that is, a valid de¬ 
duction in which all the premises are true. 

When scholars “theorize,” they are typically using deductive reasoning. 
For example, consider the origins of the law of demand. Assume that in¬ 
dividuals derive satisfaction or utility from consuming goods and services, 
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but that their consumption is limited by each individual’s budget. So the 
sum of each individual's expenditures on all goods purchased must not ex¬ 
ceed the individual's income. Assume further that the utility individuals 
derive from consumption follows certain general principles: that increased 
consumption of any good or service yields more utility, but at a decreasing 
rate. Thus, consumers are subject to tire law of diminishing marginal util¬ 
ity. Finally, assume that individuals wish to spend their income on the 
combination of goods and services that maximizes their (total) utility. It 
can be shown by manipulating these assumptions that if the price of a 
good or service rises, tire amount of that good or sendee that individuals 
will purchase should fall. This is, of course, the law of demand. The 
process of theorizing will be addressed more fully in Chapter 7. 

An inductive argument is one that reasons in the opposite direction 
from deduction. Given some specific cases or situations, what can be in¬ 
ferred about the underlying general rule? The reasoning process follows the 
same steps as in deduction: one reasons from one or more premises to a 
conclusion. The difference is in whether the premises or the conclusion is 
the general principle or the specific case. An inductive argument is not a 
proof in the sense that a deduction is. Rather, induction is a probabilistic 
inference. If the truth of tire premises increases the likelihood that the 
conclusion is true, dren we have an inductive inference. It is still possible 
for the rule to be false, however, despite the cases that lead one to believe 
that it is true. 

When scholars use statistical evidence as proof of a hypothesis, they are 
using inductive logic. Suppose a researcher believes that the price of the 
product affects tire amount the firm chooses to produce. The researcher 
collects a sample of data, runs appropriate statistical tests, and discovers 
that based on the data sample, an increase in the price of the product cor¬ 
responds to an increase in the quantity supplied by the firm. We call this 
general rule the law of supply, which in this example was validated by a spe¬ 
cific data set. In short, the specific data resulted in the general rule. That 
is induction. The use of empirical methods to test hypotheses will be ex¬ 
plained in detail in Chapters 10 and 11. 

Missimer (1995) suggests drat we can learn more by focusing on die 
logic of arguments than by labeling the reasoning as inductive or deduc¬ 
tive. This suggestion is especially true, since complex arguments can in¬ 
clude both types of reasoning, as well as the third type: warrant-based. 

Warrant-based reasoning is often used in scholarly writing. Warrants are 
unstated or underlying assumptions on which an argument stands. Often 
warrants are higher-order assumptions or axioms that are not testable. For 



example, in the previous example that deduced the law of demand, we 
assumed that consumers maximize utility. This is a generally accepted as¬ 
sumption in economics, but if you tiiink about it, how would one test that? 
How can we be sure that consumers try to maximize their utility rather 
than settling for, say, 90 percent of tiieir potential utility? (How can we be 
sure that students try to maximize tiieir GPA, rather than pursuing some 
other objective in school?) In fact, we can’t be sure. 

The purpose of a warrant is to establish the relevance of the evidence 
m supporting some claim. Consider the following claim: 

An increase in the excise tax on beer would decrease beer consumption 
among underage drinkers. 

What is die connection between a tax and beer drinking? What links the 
reason to the assertion? What makes die reason acceptable as evidence? 
The warrant here is die law of demand, something every economist is fa¬ 
miliar with: increasing the tax would effectively raise die price of beer 
thereby decreasing the quantity of beer consumed. 

When scholars mite to an audience of their peers they tend not to ex- 
plicidy identify tiieir warrants. This is because a research community is de¬ 
fined m part by the warrants it shares. As a result it is unnecessary for 
members to state them outright. Additionally, tiiis practice allows scholars 
to write their arguments in a compact form. As a novice researcher until 
you become familiar with the warrants in a research community you 
should explicitiy state all the warrants in your argument. 

Complex arguments, such as those found in scholarly articles and 
books, can include all three types of reasoning. For example, the tiieoret- 
lcal analysis may combine elements of deductive and warrant-based rea¬ 
soning, while the empirical analysis will typically be inductive reasoning. 

What Makes for a Persuasive Argument? 

So, where does that leave us? We noted earlier that for an argument to 
be persuasive, the reasons supporting it must be tine, and die conclusion 
must follow from the reasons. The evidence of the strongest arguments 
however, has a number of additional complementary characteristics. 3 In 
particular, the evidence should be accurate, autiioritative, precise, clearly 
explained, complete, and representative. Let’s turn to each of these char¬ 
acteristics. 

If the supporting reasons must be true, then factual evidence needs to be 
accurate. If your argument depends on unemployment hating risen, tiien 
you need to be sure diat it did. If you don’t check to confirm that it did, 
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some reader will and may find that it didn’t. Similarly, when you analyze 
data, a key part of empirical research, you need to confirm that your data 
are correct. Something as tririal as a data-entry error can completely 
change die results of a statistical analysis. 

Evidence should also be authoritative. If you are using data, is it from a rep¬ 
utable source? A student of mine once investigated die demand for Beanie 
Babies. She concluded that die demand for Beanie Babies as a collectible 
should depend on dieir expected future price as well as the current price, 
die buyer’s current income, and odier items one might expect to see in a 
demand function. She obtained data for expected future prices, which her 
statistical analysis revealed to be die most important determinant of de¬ 
mand for Beanie Babies. Unfortunately, she obtained the data from die 
manufacturer, which was not an objective source, since it stood to benefit 
significandy if its products were seen to be a good investment. 

Similarly, if you support an assertion witii a quotation, is die quotation 
from an autiioritative source? If you cite an argument, is it from a re¬ 
spected publication? In Chapter 3, we discussed the way in which scholarly 
publications are vetted, making diem authoritative. The Internet provides 
wonderful examples of the pitfalls of using evidence that lacks authority. 
Since virtually anyone can publish virtually anything on the Internet, one 
needs to be extremely cautious about basing conclusions on evidence dis¬ 
covered on the Internet. Bootii et al. (1995, 102) declare that if you are 
not an expert in the field, “do not trust any source as authoritative until 
you know die research in the area.” 

Additionally, evidence needs to be precise. Arguing that welfare reform is a 
bad policy because it harmed many people is weaker evidence than arguing 
that 3.7 million people lost benefits. Imprecision or ounight vagueness 
will weaken the reader’s perception of evidence, just as mistakes or care¬ 
lessness do. Imagine trying to hit a baseball witii a Nerf bat versus a 
wooden one. The Nerf bat will diffuse die energy of your swing, so it trill 
be hard to hit die ball with any force. In the same way, a vague reason lacks 
power. (Note, however, that tiiere is such a tiling as too much precision, 
for example, reporting regression coefficients to eleven decimal points.) 

Evidence is rarely self-evident. One nearly always needs to explain it. 
This is especially time of statistical results. Undergraduate researchers 
sometimes believe tiiat statistical results must merely be presented to be 
convincing, but as we will explain in Chapter 12, statistics must be inter¬ 
preted for the reader. If die evidence is not clearly explained, readers may 
not see how die conclusion follows from die reasoning. If a reason is not 
clear to die reader, it will not help persuade him or her that die argument 
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is true. In fact, it may do the opposite. Bootii et al. (1995) emphasize tiiat 
it is often helpful to ask the question: Will die reader see die evidence as 
you see it? Is there additional information that you have diat establishes 
the relevance of die evidence? If so, you may need to provide it. 

To construct die strongest possible case, the evidence needs to be complete, 
that is, it should have deptii and breadth. You can have a sound, logical ar¬ 
gument but still not persuade if your argument fails to consider all die rele¬ 
vant evidence (pro and con) or doesn’t address it in enough detail. 

Deep reasoning requires you to think deeply about causes and effects. 
Why might unemployment have risen? A student might respond, “Per¬ 
haps because businesses are producing less output, so they require fewer 
workers.” There is nothing wrong with that reasoning, except that it 
doesn’t go far enough. If that was the extent of an answer to an exam 
question, it might be labeled a “shallow argument,” one diat lacks depth. 
A better answer would go on to ask, why are businesses producing less 
output? The answer could be “Because sales are off.” Why then have sales 
fallen? Perhaps it is due to the decline in die stock market, which makes 
consumers feel less wealtiiy and businesses feel pessimistic about the fu¬ 
ture. When you reason deeply, you follow the train of cause and effect as 
far as you can. 

In a similar way, broad reasoning requires you to think widely about 
causes and effects. Instead of being satisfied with just one or two reasons 
for an assertion, you should try to identify all possible reasons. Booth et al. 
(1995) suggest that you try to anticipate all the questions a thoughtful 
reader might have about your argument, and then address those questions 
in your argument. 

The evidence presented in your argument should also be representative 
of thought on an assertion. For example, students sometimes think they 
should only report the evidence that supports their assertion. Nothing 
could be further from the trutii! Aside from being intellectually dishonest, 
tins opens your argument to die correct criticism that you haven’t consid¬ 
ered the opposing viewpoint. 4 Failure to do so will lead knowledgeable 
readers to label you ignorant at best, and fraudulent at worst. You are not 
trying to win a debate, but ratiier determine the best answer to the ques¬ 
tion. Moreover, as Booth et al. (1995) point out, one can actually improve 
the strengdi of one’s argument by admitting its limitations and conceding 
contrary evidence. In a statistical context, the same issue arises: one’s sam¬ 
ple of data should be representative of die underlying population if infer¬ 
ences based on the sample are to be valid. This important point will be 
discussed in detail later in Chapter 10. 
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Example: Evaluating an Argument 
with Unrepresentative Evidence 

Suppose die average GPA of students who graduate from college is a B. 
Does that imply that grade inflation is occurring? After all, if there were no 
grade inflation, shouldn’t the average grade be a C? There are at least two 
reasons why this may not be a warranted inference. The first involves tire 
GPA requirement for graduation. Most schools require seniors to have at 
least a C average to graduate. Thus, even if tire average GPA of the senior 
class is a C, tire GPA of graduates must be higher, since those seniors with a 
lower GPA are by definition excluded from graduation. In other words, 
die sample of data used in this argument (i.e., seniors who graduate) is not 
representative of the population of students. 

The second reason the conclusion does not follow in this argument in¬ 
volves the courses students take over their undergraduate careers. Most 
students probably end up taking courses in and perhaps majoring in sub¬ 
jects in which they do well, or at least better than average. This implies that 
almost certainly the GPA of the “average” graduate will be higher than a 
C. Again, the sample of data used is probably not representative. A better 
test might be to examine the grades students that earn in their general ed¬ 
ucation courses, which every student must take, and especially those taken 
in their first or second year of college. 

Example: Evaluating an Argument 
Using Broad and Deep Reasoning 

Plow can one explain tire increased purchases of sport utility vehicles by 
Americans during the late 1990s? One answer might be the increase in 
personal incomes over that period. But is that tire only possible reason? Af¬ 
ter all, real personal disposable income rose only 28 percent while sales of 
SUVs roughly tripled. The law of demand suggests an inverse relation be¬ 
tween price and quantity demanded. If tire price of SUVs fell, that could 
also explain the trend. But in fact SUV prices rose, so that reason isn’t 
valid. What other reasons might be suggested by demand theory? Could it 
be a decrease in tire price of gasoline, a complementary good? What hap¬ 
pened to tire price of minivans, a substitute good? When you reason 
broadly, you consider as many reasons as you can. Inexperienced critical 
drinkers need to make a special effort to cast their net widely before draw¬ 
ing conclusions to ensure they don’t miss one or more relevant reasons. 

How broadly and how deeply must one think? One rule of thumb is to 
consider tire weightiness of tire primary assertion one is arguing. A con¬ 
troversial thesis with large implications will require broader and deeper 
explanations than a more commonplace assertion. Consider tire recent 
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argument that the information age has fundamentally changed tire work¬ 
ings of tire U.S. economy with tire result drat U.S. productivity growth has 
significantly increased. This assertion, if true, will have major implications 
for the standard of living in tire United States, as well as for macroeco¬ 
nomic policy. Though a great deal of evidence supports this thesis, tire 
weight of its implications means drat for most analysts, it remains an open 
questkm. 

Example: Constructing a Simple Argument 

Consider tire following assertion: “At the next meeting of the Federal 
Open Market Committee, the Federal Reserve will almost certainly decide 
to lower interest rates again.” 

What might be some reasons in support of this assertion? What might 
lead the Fed to pursue a more expansionary monetary policy? 

8 The unemployment rate is the highest it has been in four years. 

8 Consumer confidence is falling. 

8 Business investment spending is down. 

0 Inflation is negligible. 

What might be some underlying reasons that support these larger rea¬ 
sons? For example, what might explain tire high unemployment rate or 
the falling consumer confidence; what is tire implication of low inflation? 

8 U.S. production has slowed substantially in the wake of tire stock mar¬ 
ket decline over the past year. 

8 Households are worried about the future. 

8 Given the decline in U.S. production, the capacity utilization rate 
is down. 

8 The lack of inflation allows tire Fed to pursue a more expansionary 
policy than it might otherwise be comfortable with. 

Notice how the principal assertion and the major and minor reasons 
form the pyramid structure of the argument, as illustrated in Figure 4.1. 
Assuming that tire assertion follows from the reasons aird that tire reasons 
are valid, then the more complete tire reasoning, tire more solid trill be tire 
base of the pyramid and the more solid tire argument. 

Note also that when you begin to construct an argument, you may not 
know which way you will end up arguing. Rather, you should start with tire 
research question. Next, you consider all possible answers pro and con. 
Next, you consider all the reasons for tire answers. Finally, you evaluate tire 
evidence and make a judgment. This judgment becomes the assertion, 
which is supported by your reasoning-—that is, your explanation of how you 
evaluated the evidence and why you came to tire decision you did. 


Exercises 


chapter 4 • using writing ms a 1001 tor economic nescaicn 


An Important Caveat 

A conclusion can follow from the evidence, the evidence can be correct, 
and the argument may still be incorrect. In other words, internal consis¬ 
tency in an argument is necessary but not sufficient if another conclusion 
explains the evidence better and more correctly. In the real world, schol¬ 
ars rarely have all of the relevant evidence. Rather, they construct their ar¬ 
guments based on a sample of the evidence that they think is the most 
relevant. It is always possible that they will miss some critical piece of evi¬ 
dence (an assumption, a piece of data, a relationship between parts of the 
evidence). As a result, their argument will be incorrect. 

The way to minimize this possibility is to insure that your argument has 
tire features outlined in the last section, including broad and deep think¬ 
ing. But you can never be certain. Therefore, a modicum of humility is al¬ 
ways in order when presenting a scholarly argument, even a good one. 

SUMMARY 

• Writing is a tool for creating arguments. 

• When you develop an argument you need to think deeply and broadly 
about your reasoning. 

• Each conclusion should follow from its premises. 

• Can any assertions be made more convincing with more elaboration? 

0 Are there other reasons to support the conclusion? 

• Is an alternative explanation more plausible? 

MOTES 

1. Knoblach and Brannon (1983). Zinnser (1988), one of the founders of 
the Writing to Learn concept, observes, ‘We write to find out what we 
know.” McCloskey (2000) similarly states, “Writing is thinking.” 

2. Booth et al. (1995) define an argument as including an assertion (or 
claim), evidence to support the claim, a warrant (or general principle 
that explains why you think your evidence is relevant), and qualifica¬ 
tions (or limits on tire assertion). The last three elements are what we 
referred to as “supporting reasoning” in our definition of “argument.” 

3. This section draws heavily from Booth et al. (1995). 

4. This is called the fallacy of special pleading. See Appendix 4A. 

5. Robert Weissman, “Why we protest,” Washington Post, September 10, 
2001, p. A21. 



SUGGESTIONS FOR FURTHER READING 

Booth et al. (1995) —Classic text on college-level research. Chapters 7-10 ex¬ 
plain in detail how to construct scholarly arguments, but at a level that 
is very accessible to undergraduates. 

Graff (2003) —Thoughtful critique of academia as needlessly obscuring 
what education is: being able to understand die arguments of otirers as 
well as to construct one’s own. See, especially, Chapters 4 and 5. 

Epstein and Kernberger (2005) —High-level but very readable guide to the 
logic of economic analysis. Examples are taken from classic economic 
literature and current texts. 

McCloskey (2000) —The original guide to writing in economics; first edition 
published in 1987. Very good, but also very opinionated. Not every 
point McCloskey makes about composition would be considered law by 
writing instructors. 

Missimer (1995) —Excellent introduction to critical thinking and how to 
build written arguments. 

Trelogan (2001) —Useful introduction to logic for undergraduates. 

EXERCISES 

1. Consider the following excerpt from tire Washington Post? 

The IMF and World Bank must cancel the debts owed them by im¬ 
poverished countries. Poor countries, including those that have 
passed through the institutions' debt relief program, routinely spend 
more money servicing foreign debt than they do on health care or ed¬ 
ucation. U.N. Secretary General Kofi Annan has called on rich coun¬ 
tries to provide $10 billion a year to assist poor countries ravaged by 
AIDS, tuberculosis, and malaria. By what logic should the people of 
these same countries be forced to transfer money to rich countries? 

The IMF and World Bank have sufficient funds and assets in their cof¬ 
fers to undertake debt cancellation without additional money from 
U.S. taxpayers. 

What is the principal assertion here? What are tire reasons given? What 
are the underlying assumptions? Are the reasons valid? Does tire con¬ 
clusion follow from the reasons? Do you find tire argument persuasive? 
Wiry or why not? 

2. Construct an argument to support tire following assertion: The next 
quarter’s GDP growth figures will show drat the United States has begun 
an economic recover)'. Identify several major reasons in support of tiris 





A logical fallacy is an argument that is flawed because die conclusion 
does not actually follow from the reasons stated, even though die argu¬ 
ment is phrased in a way drat makes you think the conclusion follows and 
even though die reasoning has appeal. Logical fallacies often fmd dieir 
way into public arguments since they can be persuasive to many people. 
Here is a list of some commonly used logical fallacies: 

• Straw Man—Mischaracterizing a position by omitting its strongest rea¬ 
soning, often by caricaturing it in a way that no one would agree with. 
Thus, one is not addressing the actual argument posed. For example, 
“Alan Greenspan is opposed to budget deficits because he is concerned 
that the U.S. government might go bankrupt.” 

(Greenspan has consistently opposed budget deficits on the grounds that deficits 
increase real interest rates, which diminish capital investment and ultimately 
reduce per capita GDP. The extent to which this occurs is an empirical question 
about which reasonable economists disagree.) 

• Special Pleading—Selectively using the available evidence; only using ev¬ 
idence that supports your position; ignoring any evidence in opposition. 
For example, “We should run a budget deficit since it will stimulate the 
economy.” 

(Yes, it will in the short run, but it may also adversely affect the economy 
in the long run.) 

• Begging the Question—Malting an assertion in which the reason given 
doesn’t really support the conclusion; hence, a non sequitur. Often this 
appears as a type of circular argument, where die “reason” is only a re¬ 
statement of the assertion. For example, “Consumers buy less at a higher 
price because of the law of demand!” 

(The law of demand can be paraphrased as an inverse relation between price and 
the amount consumers buy. Thus, the statement doesn’t explain why consumers 
buy less.) 

• Affir ming the Consequent—-Drawing conclusions based on unexamined 
premises. For example, “Government fine-tuning of aggregate demand 
would result in a more stable macro economy.” 

( Theoretically, yes, but this assumes that government is capable of fine-tuning — 
an assumption that many macro economists question.) 
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® Ad Hominem—Refuting an argument by attacking die person, rather 
than his or her argument; rejecting an assertion because of who is mak¬ 
ing it rather than because of the evidence. For example, “His argument 
about die budget deficit must be wrong because he’s a liberal”; “Her ar¬ 
gument about tax cuts must be wrong because she’s a conservative.” 

{An argument should be evaluated on the basis of its logic. Note that this is some¬ 
what the opposite of an appeal to authority!) 

0 Appeal to Authority—Accepting an argument because an expert en¬ 
dorses it. (Think about this fallacy the next time your instructor makes 
an assertion!) The “expert” may have no particular expertise on this is¬ 
sue. Alternatively, this may be a question on which different experts dis¬ 
agree. The argument may be correct, but it is a fallacy to accept it without 
examining the logic and evidence. For example, “Budget deficits should 
not be a concern since President Bush is not worried about them.” 

{Bush may not be worried about them, but he is apolitical leader rather than an 
economist! Some economists have a different view of deficits.) 

• Appeal to the People/Appeal to the Many—Accepting (or rejecting) a 
position because many others do, again without examining the argu¬ 
ment. For example, textbook sellers often tell faculty, ‘You should use 
this book in your class because many other schools do.” 

{My course is not the same as the courses at all those other schools; my students 
are not like the students at all those other schools.) 

0 Post Hoc, Ergo Propter Hoc (literally, “After this, therefore because of 
this”)—What comes before was the cause. Though it is intuitive to think 
that what comes before causes what comes after (e.g., a college degree 
leads to a good career), it is not necessarily true. In fact, the cause may 
come after the effect. For example, does the buildup of toy inventories in 
the first three quarters of every year “cause” Christmas? This is a varia¬ 
tion of the important dictum that correlation is not necessarily causa¬ 
tion. For example, in a cleverly titled study invoking this fallacy, James 
Tobin asked “Does tire Fed cause Christmas?” 

{Even though the money supply tends to grow in the fourth quarter of every year, 
this is in anticipation of Christmas rather than a cause of it.) 

9 Fallacy of Composition/Fallacy of Division—What’s true at the micro 
level must be true at the macro level, and vice versa. For example, the 
classical economists argued that the solution to the widespread unem¬ 
ployment of tiie Great Depression was to decrease real wages. . 
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{Keynes pointed out that if all workers face lower incomes, they would reduce their 
consumption, further lowering the demand for goods and thus for labor.) 

0 Appeal to Pity—Using sympathy for one issue as justification for another 
issue. For example, “My research paper deserves a good grade because 
I’ve been taking eighteen credit hours this semester.” 

{It is impressive that you’ve been taking so many courses this term, but the grade 
you get on your research paper)- should be based on the quality of the paper.) 

8 False Analogy—Drawing parallels between two cases where there are 
enough substantive differences to question the comparison. For exam¬ 
ple, “The way to balance the government budget in Argentina is to raise 
tiie income tax rate.” 

(Though such a policy could work in a developed country where people pay their 
taxes, in developing countries like Argentina there is widespread tax avoidance, 
so merely raising the rate may not work. Indeed, it might have the opposite effect 
because of a Laffer Curve effect.) 

A comprehensive discussion of logical fallacies can be found in Stephen 
Downes, Stephen’s Guide to the Logical Fallacies, http://www.datanation.com/ 
fallacies. 





"Planning to write is not writing. 
Outlining. . . researching. . . talking to 
people about what you're doing, none of 
that is writing. Writing is writing." 

E. L. DOCTOROW 

I 

n Chapter 4, we explained how the writing process could be used as 
H a tool for developing one’s argument. In this chapter we discuss the 
§jf use of writing as a product of the intellectual process. 

S irs Writing as a product is different in several ways from wridng as a 
process. Writing as a process is a tool tire author uses to deteimine what he 
or she thinks about a question. As such, issues like format, style, and gram¬ 
mar are largely irrelevant, since you are writing to yourself. Writing as a 
product is the report of that inquiry. Thus, the audience is someone other 
than the author. We observed earlier that scholarly writing embodies an ar¬ 
gument that attempts to persuade experts in tire field. Thus, the writing 
needs to be more explicit and formal. This means drat everything needs to 
be spelled out clearly and in sufficient detail to get your point across. Ad¬ 
ditionally, you must follow proper standards of punctuation and giammar. 

What Is Economic Writing? 

Students often tell me drat they don t know how to write an economics 
paper, or that they don’t know what makes a paper economic in character. 
Palmini (1996) identifies at least four different types of writing done by 
economists: Research Studies (including senior dieses, dissertations, joui- 
nal articles, monographs, and scholarly books); Social Criticism (including 



op-ed pieces and books for educated lay audiences); Policy Analyses (in¬ 
cluding technical reviews of legislative or regulatory proposals and other 
government reports); and Business Analyses (forecasts, market analyses, 
cost analyses, and similar analyses). 

Though many features of writing are common across all disciplines, cer¬ 
tain aspects are discipline-specific. Petr (1998, 229-230) points out that 
“Good academic writing . . . reifies] on a unique blend of vocabulary, con¬ 
cept, method, precedent and history, typifying drat discipline and reflect¬ 
ing its thought-patterns (its metaphors, if you will).” Good disciplinary 
wiiting requires more than the ability to cut and paste phrases together 
from die vocabulary of die field. What is required is a higher order of cog¬ 
nition: understanding the appropriate disciplinary context for those 
phrases. Bean (1996) observes diat each discipline asks certain types of 
questions and has its own way of analyzing them. Each discipline uses its 
own types of arguments, proofs, and empirical evidence. 1 

What fundamentally distinguishes economic from other disciplinary 
writing and what all types of economic writing share is the use of economic 
analysis. We introduced this concept in Chapter 2 when we discussed what 
makes for a research question in economics. In otiier words, all economic 
writing applies economic theory to derive insights about and explain an¬ 
swers to a question or problem. The types of theoretical arguments and em¬ 
pirical evidence economists use will be explored in Chapters 7,10, and 11. 


Writing Steps 

Writing experts have identified four steps necessary to complete a fin¬ 
ished paper: 

1. Pre-writing or Exploration 

2. Writing the First Draft 

3. Revising 

4. Editing 

The first step was the subject of the last chapter. The last three will be 
tire subject of this one. Though we describe this process as four steps, writ¬ 
ing, like research more generally, isn’t usually a linear process. Instead, it’s 
iterative; many writers loop through the steps more than once. Thus, you 
can expect to read about many of tire same issues in the later section on 
“Rerising” that you do in tire following section, “Writing the First Draft.” 

Writing the First Draft 

One of the hardest steps in completing a paper is writing the first draft. 
Even professional writers find this to be the case. How do you start this 




Writing the First Draft 
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Allocating Your Time Between the Writing Steps 

Inexperienced writers spend most of their time and effort on 
drafting, relatively less on pre-writing, and relatively little on revising 
and editing. Experienced writers reverse those proportions. They 
spend the most time on pre-writing and revising and the least time on 
drafting. You would be well served to follow the practice of 
experienced writers. 


step? You can prepare for the first draft by wiiting throughout the research 
process: taking notes as you research your topic, wiiting critiques, drafting 
your own ideas (Booth et al., 1995, 149). 

The first draft really begins in the pre-writing or exploration stage. It is 
at this stage that the writer identifies and develops tire argument to be 
made in the paper. This is done, as we described in Chapter 4, by review¬ 
ing and attempting to organize your research materials. 

Before you start writing, there is one key question you need to address: 
who is die audience for your paper? We noted at tire beginning of that 
chapter that one significant difference between wiiting as a process and 
wiiting as a product is the audience. We also noted that this is not a trivial 
question since a research paper really has two audiences: tire scholarly 
community for your topic and the instructor and classmates who are likely 
to read your paper. It is a good idea to ask your instructor who your audi¬ 
ence should be. I encourage my students to write to economics majors who 
are at tire same level as they are. In any case, this question is not one you 
should ignore. As McCloskey (2000) observes, if you fail to identify your 
audience, your paper will almost certainly miss the target. 

Features of Good Economic Writing 

Once you have your argument sketched out, you are ready to write. 
What are you aiming for? What does a well-written paper look like? 

Good economic writing has several features. These are illustrated in or¬ 
der of importance in Figure 5.1. First, good wiiting should be focused, not 
fuzzy. Early in your paper, you should make clear to the reader exactly what 
the paper is about. You can do this by inserting a thesis sentence that de¬ 
scribes the principal assertion of the paper. Experts differ on what con¬ 
struction this sentence should take. Wyrick (1994) suggests, “The purpose 


Figure s.i Features of Good Economic Writing 

1. Good writing should be focused, not fuzzy; 

2. Good writing should be organized; 

3. Good writing should be solidly developed) 

4. Good writing should be dear, concise, and precise) 

5. Good writing should be free of grammatical errors. 


of this paper is . . . McCloskey (2000) considers this poor style. Perhaps 
it is, but my advice to writers is to first communicate clearly; then worry 
about style. If you need to use. Wyrick’s construction to get your point 
across, that’s fine. As you become a more experienced writer, you’ll be able 
to get beyond this. 

Figure 5.2 contains passages from two undergraduate papers. Read 
each one and see if you can tell which one is more focused. 

What is tire primary difference between the two passages? Can you iden¬ 
tify die diesis sentence for die passage on the left? Can you do it for the 
passage on the right? 

Most readers conclude diat the passage on the right is more focused, 
primarily because of the clear thesis sentence at the end. 

The second feature of good economic writing is that it should be orga¬ 
nized. This is far more than a question of format (e.g., introduction, body, 
conclusion). Rather, scholarly papers should follow a logical, hierarchical 
structure in which the thesis is supported by a series of nested arguments 
that lead logically to the thesis as a conclusion. This was the subject of 
Chapter 4. Recall the pyramid structure of a well-developed argument il¬ 
lustrated in Figure 4.1? The structure of an argument may be a pyramid, 
but a written document is by its nature a linear structure. We can see this 
in Figure 5.3, which converts the argument from Figure 4.1 into the linear 
organization of a paper. 

To determine tire organization for your paper, you should ask several 
questions: What are the major points that you want to make that lead log¬ 
ically to your conclusion (i.e., that prove your thesis)? These points will be 
tire principal pieces of evidence to support your argument. What is the 
best order for those points in the paper? Are there any weak or missing 
links in tire logic? If so, fix them. Are there any superfluous points (i.e., 
points that don’t lead to the conclusion)? If so, omit them! 

These guidelines for an argument are true regardless of its length. In a 
short essay of one hundred words, the thesis is likely to be either at tire be- 
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[ Figure 54 ! Examples of F ocus 

Growing concerns over the increas¬ 
ing juvenile crime rate have sparked 
extensive research into the socioe¬ 
conomic and psychological factors 
that lead young adults to engage in 
illegal behavior. Over the past forty 
years, the number of juvenile court 
cases handled has more than dou¬ 
bled. The rise in destructive acts 
among youths is evident in school 
dropout rates, teenage pregnancies, 
drug abuse, and quite apparently, in 
the substantial increase in teenage 
crimes committed. Over the past 
several years, attempts have been 
made to link this growing propen¬ 
sity toward crime with the break¬ 
down of the American family. 
According to a 1988 survey by the 
National Center for Health Statis¬ 
tics, children in single-parent fami¬ 
lies, many of whom are products of 
divorce, are two to three times as 
likely to have emotional or behav¬ 
ioral problems as children in two- 
parent families. This paper is an 
attempt to solidify the correlation 
between divorce and juvenile crime. 


ginning or end, with the remaining sentences devoted to presenting the 
reasons. In an essay of several pages, each paragraph would have a thesis 
sentence, with the remainder of the paragraph devoted to explaining and 
supporting that thesis. Each paragraph’s thesis sentence would represent 
a point in support of the paper’s overall thesis. 

In an essay that has several sections or in a book with multiple chapters 
the organizational structure will follow this same hierarchical pattern. 

The third feature of good economic writing is that it should be solidly 
developed. Once you have determined your major points, you need to ex¬ 
plain each of those points in detail and support them with evidence. In 


In the postcolonial period, several 
new African nation-states experi¬ 
mented with economic develop¬ 
ment ideologies. Tanzania began to 
ostensibly establish regional hege¬ 
mony in the postcolonial period, un¬ 
der the auspices of the socialist 
world, immediately after indepen¬ 
dence in 1963. The Arusha Declara¬ 
tion of 1967 formalized the state's 
ideological orientation toward "Uja- 
maa," the collective (state-owned 
and centrally controlled) ownership 
of state-granted monopolies 
(Bagachwa, 1990). This declaration, 
however politically and rhetorically 
laudable, did little to increase the 
productive capacity of the country. 
With a quickly decaying infrastruc¬ 
ture, nationalistic development poli¬ 
cies, high levels of regional political 
and investment risk, numerous in¬ 
vestment complications and capital 
flight, Tanzania's socialist legacy has 
left a microcosmic wake of typologi¬ 
cal economic development prob¬ 
lems for the world's less developed 
countries (LDCs) (Bagachwa, 1990). 


Figure 5.3 Structure of a Written Paper 


Supporting Points 
Low paid caregivers 
will lack the emotional 
investment in the child's 
well-being that is 
associated with high levels 
of cognitive development 

Less time spent involving 
intellectual stimulation i . 
(e.g., reading to/with a 
child) , 

Turnover in employment at 
daycare centers means a j 
child may repeatedly lose ■ 
adult to whom he/she has 
become attached 


THESIS 

Children's Cognitive Development 
Is Harmed When Mothers Work 


MAJOR POINTS 

Less Caregiver Concern and Attention 
Less Interaction 

Lack of Stability in the Child's Life 


CONCLUSION 

Children's Cognitive Development 
Is Harmed When Mothers Work 


Supporting Points 
Even highly motivated 
caregivers won't be able 
to devote large amounts 
of time and attention, 
given the large number 
of children they care for 


Less opportunity for one- 
on-one interaction with 
the child 


Child's movement 
between daycare centers 
may have similar 
consequences 


other words, the same hierarchical structure that organizes tire overall pa¬ 
per should also be present at the micro level. Each section of a paper 
should have its major point stated in a single sentence. The remainder of 
the section should explain the reasoning for that major point. This is il¬ 
lustrated in Figure 5.3, where each major point is supported by several mi¬ 
nor points. 

The remaining two features of good economic writing will be explained 
in the later sections of this chapter on revising and editing your paper, 
since they are not critical for writing the first draft. 

Getting the Ideas Down on Paper 

When you start writing your draft, don’t worry' about grammar or me¬ 
chanics; don’t worry about getting the details right or fixing mistakes. In 
fact, concerning yourself with those is a good way to get bogged down. 
Rather, focus on just getting your basic ideas down on paper. Indeed, it is 
best to do this in one sitting (or for a longer paper, one complete section 
at a time). McCloskey (2000) suggests that if you can’t draft tire paper in a 
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single setting, it is helpful to write yourself notes about what you have done 
and where you are going next, before you end your work session. 

The idea is to prevent yourself from running into “writer’s block.” All 
writers experience this at some point, but if you can put this off until the 
revising stage you will be ahead of the game. (See Figure 5.4 for some prac¬ 
tical suggestions for getting over writer’s block.) 

Many writers find it useful to create an outline of what they are trying 
to say. This is not a formal topic-based outline with roman numerals like 
you were taught in high school. Rather, Booth et al. (1995) argue that 
point-based outlines, which are organized as a series of points or asser¬ 
tions, tend to be more useful since they illustrate your argument. Note also 

that this step overlaps somewhat with the earlier one about developing 
your argument. 6 

Sometimes it makes sense to develop the outline after you’ve written an 
initial draft of your ideas. For example, I usually find it easier to brainstorm 
about the first draft, writing down my ideas, and only then creating an out¬ 
line to organize them. Writing is a personal process. There is no one right 
way to do it! We noted in Chapter 4 that the outline function of your word 

piocessmg software is a helpful way to proceed regardless of the outline 
format you choose. 

Booth et al. (1995) identify several poor schemes for organizing a pa¬ 
per, which you should avoid. The first is repeating the assignment: pro¬ 
viding exactly and only what is asked for in tire assignment, and in the 
order it was asked for. The second is merely summarizing your sources. Re¬ 
member, a good paper should analyze rather than just report what others 
ave said on a topic. The third is explaining the steps in the process that 
you went through to write it. A research paper should explain what you 
found, not what you did. 

Giving Credit for Intellectual Property 

All writers know to avoid plagiarism. What may not be clear is exactly 
what plagiarism is. Plagiarism is taking credit for someone else’s words or 
ldeas, even when it’s unintentional. It is a form of academic dishonesty 
There are two types of plagiarism. The first is using someone else’s 
words as if they were your own. That is, it means directly repeating words 
from another’s work without including quotation marks. This occurs if 
someone copies directly from another’s work or if he or she submit as their 
own a complete paper they obtained from the Internet or from some other 
supplier. Plagiarism can also occur if you don’t keep careful records as you 
put together your research materials and mistake a quotation as your own 
paraphrase. This is plagiarism even if you reference the original source. 
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A Caution About Quotations 

You should use quotations cautiously, and only if you cannot 
possibly paraphrase without losing either meaning or impact. 
Excessive use of quotations suggests to the reader that you do not 
understand the source well enough to paraphrase and so you must rely 
on the words of the author. As McCloskey (2000, 45) states, "No college 
[or higher-level] paper can be fashioned by stringing together passages 
from other writers." In short, use quotations like spices, in moderation, 
to enhance but not overwhelm the flavor of your dish. 


The second type of plagiarism is using someone’s unique idea without 
attribution. By unique I mean somedring that is not common knowledge 
in the discipline. The key question here is what is known in the discipline. 
Even if sometlring was unknown to you before your research, if it was 
known by experts in the field, it is considered common knowledge. You do 
not need to cite it to avoid plagiarism. A rule of thumb is that an idea that 
occurs in the publications of three different authors is considered com¬ 
mon knowledge. Note, however, that in a literature survey you may wish to 
cite important studies, even if they are widely known, since showing your 
knowledge of important works in an area establishes your credibility. 
Booth et al. (1995, 167) state: ‘You also plagiarize when you use words so 
close to those in your source, that if you placed your work next to the 
source, you would see that you could not have written what you did with¬ 
out the source at your elbow.” This is a strict standard, but Booth et al. pro¬ 
vide some helpful suggestions for how to practice it. 

For example, when you use someone else’s work, say so early on in 
your discussion, rather than in a footnote at the end of the section. It is 
a common practice to place a footnote at the end of tire first sentence of 
a section, using wording such as “This section draws heavily from Booth 
et al. (1995).” 

The key to avoiding plagiarism is keeping careful track of what your 
sources say and what you’ve interpreted from those sources, and giving 
credit where credit is due. When you are going to cite an idea in a schol¬ 
arly work, you should reference it briefly in the form of a note, then in 
more detail at the end of the work in the reference list or bibliography. Ap¬ 
pendix 3A outlined the major citation styles used in academia. You should 
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find out fi om your instructor what style he or she wishes you to use, and 
then follow that carefully. 

Revising the Paper 

No one, not even a Nobel laureate, can write a perfect paper on the first 
try. Every first draft can be improved by revision. In fact, Booth et al. (1995 
171) observe that 

Perhaps the biggest difference between experienced writers and 
beginners is their attitude towards th[e] first draft. The experienced 
writer takes it as a challenge: / have the sketch, now comes the hard \ 
but gratifying work of discovering what i can make of it. The begin¬ 
ner takes it as a triumph: Done! I'll change that word, fix this comma, 
run the spell-checker, and <Print>l A first draft is indeed a victory! 
but resist that easy way out. 

We commonly make a distinction between a first draft and the final 
diaft. In practice, good writers go through more than two versions of their 
papers, usually many more. Each major draft may be the result of several 
revisions or minor drafts. For example, consider the first rough draft you 
write versus the more polished “first draft” you’re willing to have a friend 
read. This is analogous to the way software releases are named: WordPer¬ 
fect 5 versus WordPerfect 6 are considered major revisions, while Word¬ 
Perfect 6.1 versus WordPerfect 6.2 are minor revisions. In the same way, it 
makes sense to differentiate between major drafts of your paper, on the 
one hand, and revisions or versions of a major draft, on the other. 

As students of economics, we should understand that the marginal ben¬ 
efit of revision is positive, but diminishing. The second major draft is usu¬ 
ally much better than the first. The third major draft is even better than 
the second, though the improvement may not be as great. If you only write 
one draft, you forgo those substantial improvements. One of the most dis¬ 
appointing comments I write on students’ final papers is “This would make 
an excellent first draft.” In other words, it has great potential but that po¬ 
tential was not realized. Of course, to attain this potential you need to start 
drafting far enough before your deadline to have the time for multiple re¬ 
visions. You know this! But here is something you may not have thought se¬ 
riously about: Your instructor gives you a month to complete a writing 
assignment, not because she wants you to wait three weeks to start, but be¬ 
cause a month is how long it takes to do the assignment well, by writing 
multiple drafts and revisions. Would you intentionally show up late to an 
examination and so leave half the questions unanswered because you ran 
out of time? Of course not, but that is effectively what you do when you 
wait until the last minute to begin writing the paper. Think about it. 
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Is the Thesis Clear? 

The purpose of revision is to craft your paper so it better embodies the fea¬ 
tures of good writing listed in Figure 5.1, in order of importance. Start with fo¬ 
cus. If tire paper has no focus, then the remaining issues are irrelevant. Is the 
thesis or principal assertion of the paper clear? Can you underline the thesis 
sentence in the paper’s introduction? Knot, the most important thing you can 
do to improve dre paper is fix that. Can you underline the corresponding sen¬ 
tence in the paper’s conclusion? If not, the reader may find that your paper 
lacks closure, that it did not do what it set out to. Booth et al. (1995) point out 
that these two thesis sentences should match each other, or at least not con¬ 
tradict one another. If the first is framed in terms of a question, then the sec¬ 
ond should answer the question. If they do not match or if the second does 
not answer tire question posed in the first, they need to be reworded. For ex¬ 
ample, the thesis expressed in a paper’s introduction could be: 

This study will attempt to determine the major factors that influence the 

retail demand for diamond jewelry. 

The corresponding sentence in the paper’s conclusion could be: 

This paper has found that the demand for diamond jewelry is highly sen¬ 
sitive to the price, but highly insensitive to income. 

Note that it is not unusual for authors to have to make this kind of cor¬ 
rection, since when they drafted the introduction they may not have 
known exactly how the paper would end up. But it’s important to make 
sure the introductory and concluding thesis statements match by the time 
the paper is completed. 

Is the Paper Well Organized? 

The next step is to review your paper’s organization, which is another 
way to think about the paper’s argument. Whether you are revising a rela¬ 
tively short paper, no more than a few pages in length, or a longer paper, 
the principles are the same. One way to check your paper’s organization is 
to create a paragraph outline. A paragraph should be the explication of a 
single thought or idea or theme. Each paragraph should have a thesis sen¬ 
tence, typically the first sentence. If a given paragraph discusses two or 
more themes, you should split it into two or more paragraphs. Identify the 
thesis sentence of each paragraph, and copy each one into a new docu¬ 
ment in the order in which they occur. Now read the sentences and check 
to see if they make sense in that order and lead logically to the thesis as a 
conclusion. If they do not, then re-organize the order of the sentences un¬ 
til they do. If any sentence does not seem to contribute to the argument, 
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omit it. When your paragraph outline makes sense, then reorganize your 
paper to match tire paragraph outline. 

Let’s try this with an example. Suppose we develop tire following para¬ 
graph outline from a draft of a paper: 

1. HIV/AIDS infection reduces people’s ability to work when they be¬ 
come ill. 

2. The deaths caused by HIV/AIDS reduce a nation’s human capital, 
since not only is the labor force reduced, but the investments in 
those workers’ human capital are also lost. 

3. Immigration has been increasing into many developing nations that 
have high rates of HIV/AIDS. 

4. A nation’s labor supply depends on its birthrate, immigration rate, 
labor force participation rate, health expenditures, life expectancy, 
and investments in education. 

5. The rate of HIV/AIDS infections tends to reduce labor supply. 

6. The HIV/AIDS illness requires that resources be used to treat HIV/ 
AIDS patients that could otherwise be used for disease prevention. 

The principal assertion of this paper is sentence 5. How might we reor¬ 
ganize the sentences to support this thesis? Though there maybe more than 
one way to do this, tire following is one organizational scheme that works: 

1. The rate of HIV/AIDS infections tends to reduce labor supply. 
[Thesis sentence] 

2. A nation’s labor supply depends on its birthrate, immigration rate, 
labor force participation rate, health expenditures, life expectancy, 
and investments in education. 

[Since labor supply is the key issue, what affects labor supply?] 

3. HIV/AIDS infection reduces people’s ability to work when they be¬ 
come ill. 

[Reducing labor force participation] 

4. The illness requires that resources be used to treat HIV/AIDS pa¬ 
tients that could otherwise be used for disease prevention. 

[Reducing the labor force participation of workers who get other 
diseases] 

5. The deaths caused by HIV/AIDS reduce a nation’s human capital 
since not only is the labor force reduced, but the investments in 
those workers’ human capital are also lost. 

6. [Therefore,] the rate of HIV/AIDS infections tends to reduce labor 
supply. 

[Restatement of the diesis as a conclusion] 





Note that sentence 3 in the original outline did not lead to the diesis 
and so was omitted. 

Are Your Points Supported with Evidence? 

The third step is to examine the development of each major point in 
your paper Each point is itself an assertion that needs to be supported 
with evidence. If the first sentence in a paragraph spells out the main 
point of tiiat paragraph, the remaining sentences should flesh out and 
support that main point. Does the main point need to be explained in 
more detail? Can you provide examples of what die main point says? What 
makes you think diat the main point is valid? What evidence can you pro¬ 
vide to bring the reader to that conclusion? Any sentences that do not 
contribute to this task should be omitted or moved to another paragraph 
where they are appropriate. 

Take the second sentence in the revised paragraph outline as an example: 

A nation's labor supply depends on its birthrate, immigration rate, labor 
force participation rate, health expenditures, life expectancy, and invest¬ 
ments in education. 

This could be fleshed out as follows: 

Labor supply depends on population, which is a function of the birthrate, 
the immigration rate, and the death rate. The labor supply also de¬ 
pends on the percentage of the population that is working. This labor 
force participation rate depends on cultural factors, as well as expen¬ 
ditures on health, which help people avoid illness. Health expendi¬ 
tures also affect life expectancy, which influences the death rate. 
Though these factors determine the number of workers, the effective 
labor supply depends additionally on the amount of education that 
workers receive. 

We heed to address one last topic before we move on to discuss issues of 
style. Writer’s block is a problem that afflicts all writers at some point. Figure 5.4 
lists some practical tips for overcoming it, once you have a first draft to revise. 

Writing Style 

Once your paper has a well-defined focus and organizational structure, 
you can address the remaining features of good writing: clarity, concise¬ 
ness, and precision. This is where you polish your argument to make it as 
persuasive as possible. A good wilting style enables your reader to easily 
comprehend your writing without having to work at it. 



Figure 5.4 Practical Tips for Overcoming Writer's Block 

1. Copy the section of your paper where you find yourself stuck onto a 
separate standalone document. Edit that document, in any way that 
makes sense, even by radically revising it. The fact that it's a different 
document from the source will free you to consider radical revisions. 
Revise the paper until it's completely to your satisfaction. Then paste it 
back into the source paper, replacing the original section. 

2. Print your paper (double-spaced), and review it. Changing the format 
from computer screen to paper helps to make ideas for improvements 
leap out at you. 

3. Put the paper aside for a few days or a week. Let your subconscious 
work on it. Almost always, when you come back to it you will be able 
to proceed effectively. 

4. Construct a paragraph outline for the paper. Summarize each para¬ 
graph using a thesis sentence. Revise the list of thesis sentences as 
necessary , either by adding to them or splitting them into separate 
ideas. Review your revised list of thesis sentences for order, redun¬ 
dancy, missing links, and dead ends. Use what you've learned to re¬ 
vise the original. 


Strive for Clarity 

The most important feature of good writing style is clarity. Imagine lis¬ 
tening to a radio station that you can’t quite tune in. As McCuen et al. 
(1993, 2) note, ‘You listen intensively for a short while, but you keep miss¬ 
ing words; you can’t quite understand [what the announcer is trying to 
say].” Unclear writing is exactly like that. 

To write clearly, you should follow a few simple rules. A sentence con¬ 
sists of a subject and a verb. Compose each sentence so the subject is the 
main actor of tire story, and the verb is the main action. In addition, when¬ 
ever possible choose strong verbs over weak ones. You might think this 
would be obvious, but for some it is not. Here is an example from a re¬ 
search paper: 

"Noticeably different is the negative coefficient of CPI." 

Note how much clearer the sentence reads when the subject, Noticeably 
different, and the object, negative coefficient of the CPI, are reversed: 

"The negative coefficient of CPI is noticeably different." 


This improvement results because the main actor in the sentence is the 
“negative coefficient,” but originally it was shunted to die end of the sen¬ 
tence. This sentence also employs a weak verb: is. Norice writers tend to 
overuse to be, which generally only lengthens a sentence without adding 
anything. Verbs like to be tend to convey less information than stronger 
verbs. Suppose we replaced is with the stronger verb differs: 

"The negative coefficient of CPI differs noticeably." 

Subject: “negative coefficient” 

Verb: “differs” 

How?: “noticeably” 

Most readers would find this version much clearer than the original. 

Scholarly writing has a reputation for being difficult to understand. 
Though it is true that scholarly writing makes use of specialized vocabulary 
and specialized forms of argument and evidence, it should not and need 
not follow die turgid academic style we see too often in scholarly works. 
This style has many labels, none of them complimentary. McCloskey 
(2000) describes it as the author posing as “The Scientist” or “The 
Scholar.” Lanham (1992) calls it “the Official Style.” Harvey (2000) de¬ 
scribes this style as employing “obfuscation, nominalization, passive voice, 
[and] long, wordy passages which muddy up die question of who did what.” 
Considei', for example, this passage from a research paper: 

"Even though the numbers show sustained growth the opinions of many 

scholars are still conflicting." 

What does this mean? I don’t know. 

Researchers write this way for several reasons. First, they think they are 
supposed to. They think it sounds more formal or scientific than plainer 
prose. They think it makes them appear more knowledgeable about their 
subject. This is especially true of novice researchers. 

Second, they may be trying to express complex ideas that they do not fully 
understand. McCloskey (2000) labels this the “This-Stuff-Is-So-Complex- 
I-Can’t-Make-It-Clear” approach to writing. The solution to dais problem is 
straightforward, though not necessarily easy. You must drink through the 
ideas until you understand them well enough to explain them clearly. You 
may need to do more research to see how other authors have explained 
these ideas. You may also need to discuss the ideas with colleagues or your 
instructor. In short, tire solution is to put more work into tire project. 

Third, as Harvey (2000) points out, researchers may be trying to avoid 
taking responsibility for what they write. Writing is risky, as we noted in 
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Chapter 4. Writers worry that what they say might be wrong. If they write 
in a vague, incomprehensible style using words and phrases that they 
have seen in scholarly writing (e.g., rational expectations, first-order condi¬ 
tions, statistical significance), then the reader won’t be able to criticize 
what he or she doesn’t understand. This is like the student who, unsure 
how to spell a word, hedges his or her bets by spelling it several ways in 
an essay. In fact, the only thing that is certain is that the student will spell 
the word wrong. Writing without clarity is going to be criticized not for 
what it says, but because it says nothing. Indeed, Harvey (2000) observes 
that “Without clarity, you’re not really communicating, just going 
through the motions.” 

Do not waste your efforts trying to write this way. If you focus on clarity, 
conciseness, and precision, your style will take care of itself. 

Use the Active Voice 

Academic writers often use the passive voice. Booth et al. (1995) suggest 
that they do this because they think it makes their writing sound more ob¬ 
jective. The passive voice occurs when writers make the subject the recipi¬ 
ent rather than the performer of the action (Harvey, 2002). Usually, this is 
done by replacing an active verb with a form of the verb to be. Consider the 
following examples: 

“Next, the data were analyzed.” 

“It is expected that price will increase with disposable personal 
income." 

“It is argued that...” 

The solution is to rephrase the sentences using an active voite, where 
the subject performs the action: 

“I analyzed the data.” 

“We expect that price will increase with disposable personal income.” 

“We argue that.. .” 

Notice that the active voice is more precise in meaning. With the pas¬ 
sive voice, the reader needs to figure out who is analyzing the data, who is 
expecting the price to increase, whois arguing. To write more precisely you 
must refine your thinking. For example, why do you expect that prices will 
increase? A likely reason is because economic theory predicts it. If so, you 
should explicitly say that, something our author did not. Consider the 
third example: who or what is arguing that. . . ? Is it the researcher? Is it 
the results of the analysis? Again, by thinking about subjects and verbs as 
you refine your thinking, you also refine your writing. 
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Using the First Person 

Researchers are often reluctant to use the first person in their 
writing. Frequently, the result is that they use the passive voice. In 
fact, there is nothing wrong with using the first person, especially 
when you are describing your interpretations and conclusions. If you 
can make your writing clearer by using the first person, then do so. 


Passive voice is not incorrect per se, but by disguising the subject of the 
sentence, it makes the reader work harder to figure out what exacdy you 
are saying. For this reason, until you become a skilled writer you should 
probably avoid the passive voice. 

Describe Action with a Verb 

Academic writers also tend to nominalize, that is, turn verbs into nouns. 
Again, their motivation seems to be to make tire writing sound more schol¬ 
arly. In fact, it merely makes it sound more pretentious and harder to read. 
Here are some examples: 

“There was a failure of the results to confirm the hypothesis.” 

“The growdr of an economy is somewhat fluctuated due to the amount 
of technological increase.” 

“Next, the data were subjected to analysis.” 

Nominalization tends to add words to a sentence without adding mean¬ 
ing. Thus, the sentence is harder rather than easier to read. Additionally, 
like fire passive voice, nominalization obscures the subject of the sentence. 
The solution to nominalization is to describe the action with a verb. 
“The results failed to confirm the hypothesis.” 

“The growth of an economy fluctuates due to the amount of techno¬ 
logical increase.” 

“Next I analyzed the data.” 

In sum, to write clearly choose strong verbs to describe the actions in 
your sentences, and make the main actors the subjects of your sentences. 

Be Precise and Concise 

Most of our discussion of style has focused on clarity, which is its most im¬ 
portant feature of style. Now, let’s discuss precision and conciseness. Dis- 
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cussing complex issues often requires nuance. Thoughtful writers understand 
that word choice matters and that synonyms have slightly different meanings. 
If you want readers to understand you, choose tire word that means exactly 
what you wish to say. Precision, like clarity, suggests to tire reader that you 
know what you are talking about. Vagueness conveys the opposite impression. 
Consider tire following sentence: 

"Some areas that enforce affirmative action are helping those who are at 

a disadvantage to overcome discrimination." 

Think about this sentence. It begins with a vague subject: Some areas. 
One cannot easily tell whether this refers to areas of law, areas of culture, 
or geographic areas. The verb in the sentence (are helping ) is weak. The 
next phrase ( those who are at a disadvantage) doesn’t really add any infor¬ 
mation for the reader. Compare the original with this revised version: 

"The states that enforce affirmative action help individuals overcome dis¬ 
crimination." 

Another characteristic of good writing style is conciseness. Try to make 
every point as concisely as you can. Avoid empty words that don’t add in¬ 
formation for tire reader. Never say more than you need to. 

Occam’s razor is a well-known corollary to tire scientific method. It pro¬ 
poses that when choosing among alternative theories, one should select 
the theory that explains the phenomenon being studied with the least 
complexity. The same rule is true for writing. Less is more as long as the 
explanation is complete. 

In the real world, a supervisor will never ask you to write a report of at 
least twenty pages. Time is a scarce resource and people prefer brevity, so 
long as they still get the point. The same is true of academic readers. 

Don’t worry about the length of your paper. Never add “filler.” If you go 
through tiie writing process described here, you will have enough to say to 
satisfy your assignment. Of course, this assumes that you have thought 
through your topic carefully and completely during the drafting and revi¬ 
sion steps. If you haven’t, filler won’t help. 

If you want your writing to be persuasive you should avoid emotion¬ 
laden phrases. Good writing is hard work. If you don’t take your writing se¬ 
riously, why should the reader? 

Writing Mechanics 

The last step in completing a paper is editing. Now is the time to wony about 
correcting grammar, mechanics, as well as spelling and typographical errors. 
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Use Complete Sentences 

Serious writers employ complete sentences. A complete sentence im¬ 
plies a complete thought, while a sentence fragment implies fragmented 
or incomplete drought. As we noted previously, a complete sentence con¬ 
sists of a subject and a verb. A sentence fragment is a subject without a verb 
or a verb without a subject. Few 7 of us would write “An economist” or “Tested 
the hypothesis” by themselves. Yet sentence fr agments are common among 
novice writers. 

Often novice writers will use a phrase as a sentence, when grammatically 
it is not. Take the following, for example: 

"The key question for determining whether you have statistically signifi¬ 
cant results." 

In this phrase, The key question is a noun, and for determining sounds like 
a verb, but grammatically it isn’t. Compare the phrase with this sentence: 

"The key question for determining whether you have statistically signifi¬ 
cant results is does your estimated t-statistic exceed your critical t." 

You can see that the original phrase, even though it has a noun and a 
verblike word, serves only as die subject of the full sentence. Since the orig¬ 
inal phrase includes a subject but no verb, it is not a complete sentence. 

Alternatively, a phrase may include botir a noun and a verb, but drey are 
subordinated by another word: “ Although the estimated coefficients were neg¬ 
ative.” In this phrase, the estimated coefficients is a norm, and were negative is a 
verb. The phrase the estimated coefficients were negative is therefore a complete 
sentence, but by preceding them with Although you create a dependent 
clause, which is not a complete sentence because it needs additional wording 
to complete its meaning. Compare the original with the following sentence: 

Although the estimated coefficients were negative, they were not statis¬ 
tically different from zero. 

You should be able to see that tire dependent clause was an incomplete 
thought, and thus an incomplete sentence. 

The last type of sentence fragment occurs when a phrase has both a 
noun and a verb, but the verb isn’t properly conjugated for the noun: “A 
software package generating statistical results.” A software package serves as 
a noun, and generating serves as a verb, but the entire phrase isn’t a com¬ 
plete sentence. To determine whether you have a complete sentence ask 
yourself whether the phrase standing alone is a complete thought. If so, 
you have a complete sentence. If not, you need to fix it. 
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Don't Let Sentences Run On 

A run-on sentence is two or more independent clauses that are not sep¬ 
arated with the proper punctuation. In a run-on sentence several com¬ 
plete thoughts are jammed together, making it difficult for tire reader to 
determine where one ends and the next begins: 

"An increase in the price of seed caused an increase in the price of 
corn an increase in the price of corn caused a decrease in the quantity 
demanded." 

A run-on can be fixed in several ways. The two clauses could be con¬ 
nected with a semicolon: 

"An increase in the price of seed caused an increase in the price of 
corn; an increase in the price of corn caused a decrease in the quan¬ 
tity demanded." 

They could also be connected by a conjunction such as and: 

"An increase in the price of seed caused an increase in the price of corn, 
and an increase in the price of corn caused a decrease in the quantity 
demanded." 

Alternatively, the two clauses can be divided into two complete sentences: 

"An increase in the price of seed caused an increase in the price of 
corn. An increase in the price of corn caused a decrease in the quan¬ 
tity demanded." 

Given the widespread availability of spell-checking software, there is no 
excuse for misspelled words in a paper. McCloskey (2000, 27) points out 
that spelling errors and typos make you “look like a careless dolt.” At best, 
readers will think you are careless. At worst, they will question your entire 
paper. Note, however, that the spell-checking software is not sufficient to 
catch all typos. For example, it will not find missing words. For that you 
need to review the document closely. Also, it is a good idea to have some¬ 
one else read tire final document. A friend is more likely to catch errors 
than you are, since you are (too) familiar with your writing. 

It is true that content is more important than mechanics in a paper. But 
in practice, failure to write well, that is, failure to use proper style and me¬ 
chanics, inhibits your ability to communicate. This is not “mere grammar” 
but rhetoric and argumentation, the art of using words to persuade. If you 
can’t communicate, you can’t persuade tire reader that your argument is 
correct. 
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SUMMARY 

• Economic wilting is writing that applies economic analysis to derive in¬ 
sights about an issue or problem. 

0 Writing a first draft is hard, even for experts. The key is to sit down, or¬ 
ganize your research materials, and do it. 

0 Well-written papers are focused, clearly organized, fully explained, pol¬ 
ished, precise, and concise. Additionally, they are free of grammatical 
and typographic errors. 

• Plagiarism means taking credit, even unintentionally, for the words or 
ideas of others. 

• No one can write a perfect paper in one draft. Good writers write many 
drafts, revising and revising again, until they get it right. 

MOTES 

1. Note that the humanities, since they are nonscience disciplines, do not 
use empirical evidence. 

SUGGESTIOMS FOR FURTHER READING 

Booth et al. (1995 )—Chapters 11-15 explain virtually every aspect of writing 
a college-level research paper, from pre-writing to revising for organi¬ 
zation and style. 

Harvey (2000 )—Excellent guide to writing that is very readable, even en¬ 
joyable. Harvey includes numerous excellent examples. 

McCloskey (2000 )—Classic monograph on wilting for economists. Read¬ 
able in one sitting. Originally published in 1987 under the title Eco¬ 
nomical Writing. 

McCloskey (1999) —-Four-page executive summary of McCloskey (1987). It’s 
better than nothing, but read the original. 

McCuen et al. (1993 )—Excellent guide to technical writing and speaking for 
engineers and other scientists. 

Thomson (2001 )—Useful guide for young Ph.D.s or others who are writing 
about theoretical economics. Quite technical, but other researchers 
may find it of value as well. 
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EXERCISES 

1. Find an example of “social criticism” written by an economist (e.g., Milton 
Friedman, Paul Krugman, Robert Heilbroner). Find an example of a re¬ 
search report written by the same author. In what ways are the two types 
of writing similar? In what w'ays are they different? 

2. Using the sources you have discovered so far on your research topic, 
draft a survey of the literature of no more than two pages in length. The 
focus of your survey should be to summarize the consensus in the liter¬ 
ature on your research topic. Briefly summarize the major studies that 
have contributed to this consensus. Be sure that your survey illustrates 
the first three features of good economic writing: focus, organization, 
and solid development of tire main points in tire survey. 

3. Write a clear thesis for your research paper. The thesis may well be a ten¬ 
tative one at this point. 

4. Find a paper you have written for another class. Underline the thesis of 
the paper. Is the thesis clear? If not, revise it to make it clearer. Con¬ 
struct a paragraph outline of the paper. Evaluate the outline. Is the out¬ 
line organized in the best way to lead logically to the thesis as a 
conclusion? If not, reorganize the outline. Are there any points that are 
extraneous? If so, delete them. Are all the major points in tire paper ex¬ 
plained fully and clearly? If not, correct them. 

5. Print a copy of the literature survey you wrote for Exercise 2, or another 
short paper. Circle every usage of the passive voice with red ink. Rewrite 
each using active voice. Compare the passive version with the active 
one. Next circle every nominalization with blue ink. Rewrite each using 
a verb to describe the actions. Print a copy of the revised paper and 
compare it to the original. Which do you think is clearer? Why? 





"The difficulty you have in reading 
doesn't necessarily reflect on your mental 
ability. Serious reading is hard work." 


SUSAN WISE BAUER 


|j| rofafgibnal writing in any discipline can be daunting to decipher. It’s 
r/ilpidrid of like your first course in a foreign language. Remember how 
•: lost you felt at die beginning? That feeling is completely under- 
ETirilstandable. Addressing tiiis issue for students, Schroeder et al. (1985) 
observe, ‘You are typically confronted with terminology and analytical 
techniques with which you are at best unfamiliar, or at worst totally igno¬ 
rant. There is no reason for you to feel badly about this state of affairs; as 
a neophyte in the world of research, you cannot expect mastery of all 
phases of research from the start.” 

Research papers tend to be written in a formal style that is designed to 
provide clarity and precise meaning in as concise a way as possible. This 
implies, however, that authors may leave out descriptions of common steps 
in the research process. This adds to the difficulty for novice readers, who 
are not (yet) members of drat particular research community. 

Fortunately, like a foreign language, once you learn the basic vocabu¬ 
lary and syntax, professional economics writing will begin to make sense. 
In other words, once you learn the “code,” you will find it much easier to 
decipher published research. 


! 
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Making Sense of Published Research 

There are at least two ways to drink about how to make sense of pub¬ 
lished research. We can call them format and argument. Let’s consider 
drem in order. 

Understanding Format 

What’s die difference between an essay you write for an English com¬ 
position class on what you did on your summer vacation and a lab report 
for chemistry? The answer is: specific format. The scholarly writing for 
each discipline uses a somewhat different format, witir tire physical sciences 
using die most structure (e.g., the lab report), die humanities using rela¬ 
tively less structure, and the social sciences lying somewhere in between. 
Even witiiin die social sciences formats vary, though relatively less than die 
formats of the sciences and the humanities differ. 

Economics research papers tend to follow a common format tiiat illus¬ 
trates the scientific metiiod. This specific format allows experts to look 
quickly at a paper to see if tiiey want to spend the time to read it more 
closely. For example, Thomson (2001, 2) states: 

A reader who has found your central point interesting and wants 
to know more about it but has little time to invest in your work . . . 
should be able to grasp the novel aspects of your [research] without 
actually reading the paper. A lot can be learned from a well-written 
argument by just glancing at the way it is structured. 

In economics, there are three types of scholarly works. The first is a sur¬ 
vey of the work of others. You can think of this as an entire paper devoted 
to reviewing the literature on a topic; its objective is to summarize for read¬ 
ers what is known on a subject to date. Survey papers, unlike the odier two 
types, are secondary literature. 1 We noted in Chapter 3 that the Journal of 
Economic Literature tends to publish survey articles. For example, the March 
2002 JEL includes an article entided “Looking Inside the Labor Market: A 
Review Article.” The second type of scholarly work in economics is the 
purely theoretical study. It creates or modifies a theory and discusses its im¬ 
plications, but contains no empirical testing. An example of a purely the¬ 
oretical article is Krugman’s (1979) seminal article on the possibility of 
governments effectively practicing strategic trade policy. The third type of 
scholarly work in economics—and die one we will focus on—is the em¬ 
pirical study. The majority of research projects in economics are empirical 
studies. This type offers a particular advantage, especially for beginning re¬ 
searchers: when you do an empirical study, whether your empirical testing 
confirms your hypothesis or rejects it, you have results tiiat enable you to 
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satisfactorily conclude your study. By contrast, when you attempt a purely 
dieoretical project, if you can’t get die theory to work die study has failed. 
For this reason, purely dieoretical works are best left to experienced re¬ 
searchers. 

Let’s examine die format for die typical empirical research study. The 
typical study in economics has four essential components: an introduc¬ 
tion, an analysis of the problem, an empirical test of die analysis, and a 
conclusion. An actual paper (or book) may have more formal sections, but 
to be credible to experts in the field die study, it must cover tiiese four 
components. Let’s examine each of tiiese in turn. 

The introduction should define die general topic and the specific re¬ 
search question, as well as explain the motivation for die research. The in¬ 
troduction should also include a review of the work of previous 
researchers on the topic, especially what is lacking in die existing literature 
and how the current study proposes to address that shortcoming. In a the¬ 
sis or book, die literature review is usually placed in its own chapter, sepa¬ 
rate from die introduction. In a paper it is more often combined with die 
other introductory material. 

The second section in an economic research report is the heart of any 
economics study: die application of economic analysis to shed light on die 
research question. It may be labeled “Theoretical Analysis” or “Theoretical 
Model.” (Exactly how one applies economic analysis to a problem is die 
subject of Chapter 7.) This section develops die dieoretical model used by 
die study, and derives the testable implications or hypotheses of die model. 

The tiiird section explains how the proposed analysis from section two 
is tested. (This will be the subject of Chapters 10 and 11.) This section 
could be labeled “Empirical Analysis,” or “Empirical Model.” It explicidy 
states what results would confirm the theory. It presents the results ob¬ 
tained from the testing procedures and interprets diem. To what extent is 
the theory confirmed? 

The concluding section in an economic research report explains die in¬ 
sights learned from the research. What answer did economic theory sug¬ 
gest for the research question? Was this answer confirmed by the empirical 
evidence? If not, why not? 

Economics studies may also include several odier parts, such as an ab¬ 
stract, references, and appendices, but diese four components are the 
principal parts of die work. 

Evaluating the Argument: Reading Critically 

Once you realize how a research report is laid out, you are part way to 
understanding it. To complete that understanding, you will need to learn 
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how to read deeply and critically. And this requires that you understand 
the article’s argument. 

Reading critically is challenging. The purpose of reading scholarly texts 
is not simply to get through tire pages while recording tire facts the author 
presents. Rather, the task is tire more complex one of discerning and eval¬ 
uating tire author’s argument. Mursell (1951, 58) describes this process as 
follows: 

When you read properly [i.e., critically], you are not merely assim¬ 
ilating. You are not automatically transferring into your head what 
your eyes pick up on the page. What you see on the page sets your 
mind at work, collating, criticizing, interpreting, questioning, com¬ 
prehending, comparing. 

To this end, scholars don’t merely read texts; they study them. First, 
they may skim an article to see if it is going to be useful to them, paying 
particular attention to certain sections. For example, some scholars focus 
on the introduction and the conclusion; others focus on the introduction 
and the empirical results section. If they decide the article may be useful, 
they read it again more carefully, often several times. Scholars read slowly, 
and try to discover the meaning. They attempt to interact with the text, 
essentially engaging in a “conversation” with the author. This conversa¬ 
tion takes place partiy in the reader’s mind, and partly through the notes 
the reader makes (e.g., in the margins of tire text) as he or she works 
through die text. (How to take research notes effectively is the subject of 
the last section of this chapter.) 

The text presents an argument. The reader responds by asking ques¬ 
tions and expressing tentative ideas about meaning: “Is this what you 
mean?” “But what about this?” “Okay, I understand die previous part, but 
I’m not sure I fully understand this one yet.” 

In Chapter 4, we explained how writers develop arguments through 
the process of composition. Readers analyze texts in the same way, by writ¬ 
ing ideas down, rearranging them, thinking about them, and asking ques¬ 
tions about die tentative meaning. The meaning becomes clear after you 
reread an article several times, sometimes over what may be an extended 
period. 

Bean (1996, 136) asserts that readers play two opposing roles in this 
process: “the open-minded believer who can succumb to the text’s power 
and die skeptical doubter who can find weaknesses in the text.” Since an 
argument is an assertion rather than a fact, and since, as a consequence, 
“every autiior necessarily distorts his or her subject,” healthy skepticism is 
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; NOTES FOR NOVICE RESEARCHERS 

Several Important Distinctions for Critical Reading 

Ruggiero (1998) suggests several important distinctions that are 
important for reading critically. The first distinction is between the 
person speaking and the idea he or she is expressing. (This is related 
to the ad hominem attacks we mentioned in Chapter 4.) Critical 
reading requires that you give a fair hearing to the ideas, even if you 
have a poor opinion of the author. The second distinction is between 
matters of taste (i.e., pure opinion) and matters of judgment. There are 
no grounds for criticizing the former but may be many for criticizing 
the latter. The third distinction is between fact and interpretation. 

Often, writers will present an interpretation as a fact so as to lend it 
more weight. Careful readers can distinguish between the two. The 
fourth is between literal and ironic statements. The latter are intended 
to provoke a strong reaction for rhetorical purposes. Fortunately, you 
are not likely to run into this very much in scientific writing. The last 
distinction is between an idea’s validity and the quality of its 
expression. In principle, you should evaluate the validity of an idea 
independently of how well or poorly it is expressed. In practice, 
however, the quality of written or oral expression almost certainly 
colors your impression of the validity of one’s thought. Note that this is 
a point raised by McCloskey (1998), as discussed in Chapter 1. 



called for (Bean, 1996,140). Careful reading of a text requires you to give 
adequate attention to both roles. Malting sense of scholarly writing is a lot 
like peeling an onion: it has many layers. You shouldn’t expect to discern 
all die levels of meaning without reading the work many times. 

Questions to Guide Critical Reading 

Though reading scholarly -writing is rarely easy, one’s ability to do it suc¬ 
cessfully does improve with practice. Critical reading is a learnable skill. 

Recall diat we defined an argument as an assertion supported by logi¬ 
cal or empirical evidence. Table 6.1 lists a series of questions to guide your 
reading and help you identify an author’s argument. Note that there may 
be more than one interpretation of the argument in a published work. 
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Tips for Getting Through a Scholarly Journal Article 

Novice researchers run into three common roadblocks that inhibit 
their ability to get through a scholarly journal article. These are: 
unfamiliar terminology or jargon, mathematical reasoning, and 
econometric issues and methods. Reading journal articles is difficult 
enough that the reader who confronts any of these roadblocks may be 
persuaded that he or she won't be able to understand the article. Don't 
let this happen to you. 

When you run across terminology or concepts that you don’t 
understand, write them down. Locke et al. (1998, 69-70) observe, "It 
may sound unlikely to you but we find that there are few instances 
when a single unfamiliar technical term brings reading to a complete 
halt. Just remember that in reading a technical text that is not your 
own, it is inevitable that there will be problems of comprehension. 

You have to puzzle them out or, failing that, flag them and get on with 
the task." 

Sometimes the jargon is explained elsewhere in the paper. 
Alternatively, you should try to look up the terms. An obvious 
place to look is your research methodology or econometrics 
book. Another useful source is Peter B. Meyers's "Online 
Glossary of Research Economics” terms, which is available at 
http://www.econterms.com. If all else fails, you should not 
hesitate to ask your instructor for help. 

Unfamiliar mathematics or econometrics should not prevent you 
from getting the gist of an article. Mathematical methods are used to 
develop a conclusion. You can think of them as "proofs." The 
mathematics shows the reader how the author arrived at the finding. 
But you don't need to understand the proof to grasp what was being 
proven. Almost certainly, if the paper is published someone will have 
checked the math to ensure that the proof works, so you don’t need to 
worry about that. What you may not realize is that many readers, 
including professional researchers, bypass the mathematical details 
the first few times they read a paper. 

These same points can be made about econometric issues. The 
purpose of statistical methods is to test hypotheses. Whether or not 
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you understand the particular techniques used, you should be able to 
determine if the results confirm or refute the researcher's hypothesis! 
Locke et al. (1998, 70) point out, "In any sound report, somewhere will 
be found a plain language description of anything found in the analysis 
that really mattered. . . .In many cases, that bit of text-should allow 
you to proceed with intelligent reading—even if not with full 
appreciation of the elegance (or appropriateness) of the researcher's 
statistical analysis. ” 


Identifying the Author's Argument 


1: What question is the author asking? 1 7 V Yfr. -..LL 

2. What answer does the author propose (i.e., what is the principal asser- 

tion of the study)?. ' ,7, k • ■ ;!' b ALA A-A : A, 77 T yA A 7777 

3. In what ways does the study improve upon previous research?. 1 

4 How does the proposed answer compare with that provided by previ¬ 
ous research? > 

5. What are the major logical or theoretical reasons for the author's 1 ' 

argument? : .7 • 77 77; ; v 7.;.7 7V : -A: yl 7^ A: 1 ■h 1 "/A 

6, What empirical evidence does the author provide? 

; 7.: What assumptions is the author making in his or her reasoning? 


Let’s work through a sample scholarly article from a professional eco¬ 
nomics journal. The article is David Romer’s “Do Students Go to Class? 
Should They?” (1993), which is reproduced on tire following pages. Read 
the article through at least once. 

Now, reread tire article carefully. As you read, think about tire questions 
in Table 6.1 and write down tentative answers as they come to you. (The 
answers that I provide for each question are exact quotations from tire ar¬ 
ticle to demonstrate where I found the answers. When you mate your own 
answers to these questions, you should put them in your own words. For 
the reasoning behind this, see tire section “Taking Research Notes and 
Writing Abstracts and Critical Reviews” later in this chapter.) 



chapter 6 ® Critical Reading or How to Make Sense of Published Research 


Do Students Go to Class? Should They? 

David Rattier 

Lectures and other class meetings are a primary means 
of instruction in almost all undergraduate courses. Yet al¬ 
most everyone who has taught air undergraduate course 
has probably noticed that attendance at these meetings is 
far from perfect. There is surprisingly little systematic evi¬ 
dence, however, about attendance and its effects. There 
are three natural questions. What is tire extent of absen¬ 
teeism? How much, if at all, does absenteeism affect learn¬ 
ing? Should anything be done about absenteeism? 

This article presents quantitative evidence on the first 
two of these questions, and speculative comments on the 
third. First, attendance counts in economics courses at 
three relatively elite universities indicate that absenteeism 
is rampant: usually about one-third of students are not at 
class. Second, regression estimates of the relation between 
attendance and performance in one large lecture course 
suggest drat attendance may substantially affect learning: 
considering only students who do all of tire problem sets 
and controlling for prior grade point average, the differ¬ 
ence in performance between a student who attends regu¬ 
larly and one who attends sporadically is about a full letter 
grade. In light of these results, steps to increase atten¬ 
dance, including making attendance mandatory, may de¬ 
serve serious consideration. 1 


1 I have been unable to find any previous investigations of the extent of 
absenteeism. There have been a few other studies of the relation be¬ 
tween attendance and performance (for example, Schmidt, 1983; Park 
and Kerr, 1990). These studies generally confirm tile findings here that 
attendance and performance are related even when a variety of student 
characteristics are controlled for. The present study differs from this 
earlier work in focusing on the quantitative magnitude of the relation¬ 
ship and on the issue of the extent to which the relationship reflects a 
genuine effect of attendance. 

E David Romer is Professor of Economics, University of California, 
Berkeley, California. 
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Do Students Attend Class? 

i Counts were made of the number of students attending 

one meeting of every undergraduate economics class dur- 
i ing a “typical” week of the spring 1992 semester at three 
• schools. School A is a medium-sized (6000 undergradu¬ 
ates) private university; School B is a large (20,000 under¬ 
graduates) public university; and School C is a small (2500 
undergraduates) liberal arts college. The schools are in¬ 
tended to be representative of the upper echelons of Amer¬ 
ican colleges and universities. All three are classified by 
Barron’s Profiles of American Colleges (1991 edition) as “highly 
! competitive,” the second highest of their six categories. 2 

The attendance counts were made a few weeks before 
the end of the semester at each school. This choice 
avoided both times when attendance is generally thought 
to be unusually low (such as just after exams and immedi¬ 
ately before and after vacations) and times when it is gen¬ 
erally thought to be unusually high (such as just before 
exams). Individuals at all three schools independently sug¬ 
gested that attendance a few weeks before the end of the 
semester was likely to be representative of average atten¬ 
dance. Attendance was taken at one meeting of each class 
during the sample week. Current enrollment figures were 
obtained from departmental offices. 

Table 1 reports the results. The first row shows the over¬ 
all absenteeism rates at each school. At School A, 34 per¬ 
cent of students were absent; at School B, 40 percent; and 

I _ 

3 

3 2 A total of 127 schools, enrolling about 675,000 undergraduates, are 

classified by Barron’s as “most competitive” or “highly competitive.” Of 
these, 24 schools, with 140,000 undergraduates, are private universities 
with between 4000 and 10,000 undergraduates; 14, with 270,000 under¬ 
graduates, are public universities with over 10,000 undergraduates; and 
31 56, with 100,000 undergraduates, are colleges or universities with mini¬ 

ma] graduate programs and fewer than 3500 undergraduates. The re¬ 
maining schools are small and medium-sized public colleges and 
universities (15 schools, with 70,000 undergraduates), large private uni- 
| versifies (5 schools with 70,000 undergraduates), and small private uni¬ 
's versifies (13 schools with 30,000 undergraduates). 


/3 - ; 3 
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Table 1 Absenteeism Rates in Economics Classes 


All Economics Courses 
By Size of Course: 

Small (bottom 33%) 

Large (top 33%) 

By Mathematical Content: 

Mathematical 
Non-Mathematical 
By Type of Course: 

Principles & 

Intermediate Theory 
Upper Level, Only 
Principles Required 
Upper Level, 

Additional Requirements 

at School C, 25 percent. 3 In short, on a typical day at a typ¬ 
ical elite American university, roughly one-third of the stu¬ 
dents in economics courses are not attending class. 

The remaining rows of die table break down die over¬ 
all figures along various dimensions. Course size appears 
to have an important effect on absenteeism. At all three 
schools, absenteeism is considerably lower in the smallest 
third of classes than in the largest diird. In addition, aver¬ 
age class size is lowest at School C and highest at School B, 
which is consistent with the fact that absenteeism is lowest 
at C and highest at B. Absenteeism is also lower in courses 
widi a significant madiematical component (such as 
econometrics, honors sections of intermediate dieory, and 
field courses in theory). This pattern holds at all three 


3 Attendance counts were inadvertendy not made in'a handful of classes 
at School C. These classes do not appear to differ in any systematic way 
from the classes at which attendance was taken. 


School A School B School C 


34.0% 

39.7% 

24.8% 

27.0 

37.7 

21.5 

38.8 

42.9 

30.4 

10.0 

17.6 

16.7 

34.3 

41.5 

25.5 

37.3 

40.5 

29.7 

26.4 

41.1 

17.7 

33.3 

35.2 

20.7 
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schools. 4 Similarly, at all three schools absenteeism is 
somewhat higher in core courses than in field courses. 

Finally, it is generally perceived, not surprisingly, that 
students attend class more often when the quality of in¬ 
struction is higher. At School B, for example, absenteeism 
is 34 percent for courses taught by regular faculty and 47 
percent for courses with other instructors. To investigate 
this issue more systematically, course evaluation data for all 
undergraduate economics courses for one term were ob- 
| tained from a fourth school, School D. This school, like 
School B, is a large public university. The two variables of 
interest are students’ average rating of the overall effec¬ 
tiveness of the instructor and the fraction of the students 
enrolled in the course who returned the course evaluation 
form (which is a reasonably good measure of attendance 
: at one of die last class meetings of the term). The point es¬ 

timates from a simple regression of die fraction of students 
i attending the class on tire average rating imply that raising 
the average rating from the 25th percentile to the 75th 
lowers absenteeism by 10 percentage points; the /-statistic 
on tire rating variable is 3.4. Thus the quality of instruction 
•l (or at least students’ perception of that quality) appears to 
have an important impact on attendance. 

Other features of tire data from School D generally con¬ 
firm the findings for the other schools. 5 Absenteeism is high 
; (45 percent across all courses), and lower'in small courses 

than in large (31 percent in the smallest third of courses and 
|j 54 percent in the largest third). Again, absenteeism is lower 


4 The figures for mathematical courses at School A are based on only 

| one course. Thus this figure should be given little weight. 

5 The data from School D are not strictly comparable with those from 

| the other schools, because they reflect class meetings at the end of the 

? term and because a few students are present but do not return the eval¬ 

uation form. It seems unlikely that these differences have any substan- 
2 dal effect on the results. 
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in courses with a mathematical emphasis (39 percent, versus 
47 percent for other courses), and higher in core courses 
(52 percent, versus 31 percent in field courses that only re¬ 
quire principles and 37 percent in advanced field courses). 

A straightforward regression confirms these patterns of 
differences in absenteeism across different types of courses. 
Specifically, using die data from all four schools, I ran a 
regression (across courses) with the fraction of students ab¬ 
sent as the dependent variable, and a constant, the log of 
enrollment, and dummies for mathematical content, for 
the two types of upper level courses, and for three of the 
four schools as independent variables. The resulting esti¬ 
mates imply that a doubling of enrollment is associated 
with a rise in absenteeism of 4 percentage points; that math¬ 
ematical content is associated with a fall in absenteeism of 
3 percentage points; and drat moving from a core course to 
either type of field course is associated with a fall in absen¬ 
teeism of 5 to 7 percentage points. The coefficient on the 
enrollment variable is highly statistically significant; those 
on the field course dummies are marginally so; and those 
on die dummy for mathematical content and the three 
school dummies are insignificant. 

Should Students Attend Class? 

These findings raise the question of whether absen¬ 
teeism has a substantial effect on learning. It is possible that 
students do not attend class because they would learn rela¬ 
tively litde if they did—because the instruction is of low qual¬ 
ity, or because they have already mastered the material, or 
because they can learn the material better by spending the 
same time studying in other ways. Alternatively, it is possible 
that learning is severely adversely affected by absences, but 
that many students are absent anyway—because they have 
genuinely better uses of their time, or because they mistak¬ 
enly believe that attendance is not important to learning, or 
because they attach relatively little importance to learning. 

Because student attendance is not exogenous—stu¬ 
dents choose whether to attend class—it is not possible to 
isolate definitively the impact of attendance on learning. 
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| 

But this section presents some suggestive evidence. In the | 
fall 1990 semester, I took attendance at six meetings of my 
large intermediate macroeconomics course. The resulting j 
data can be used to investigate the relation between atten- j 
dance and performance. ( 

As in other courses, overall absenteeism was high (25 | 

percent). Twelve percent of the students missed four or ; 
more of the meetings where attendance was taken; 28 per- ' 
cent missed two or three; and 59 percent missed none or I. 

one. Thus, absenteeism appears to be a mixture of some I 

students missing most classes and many students missing a | 
smaller number of classes. I 

Student performance is measured as the overall score on 
the three exams in tire course. For ease of interpretation, the 
scores are converted to tire usual 4-point grading scale: 3.84 
and above represents ah A; 3.50 to 3.83 an A-; and so on ; 

down to 1.50 to 1.83 for a C—. Because no D +’s or D —’s were ( 

assigned, 1.17-1.49 represents a D and 1.16 and below an F. f 

The first column of Table 2 reports the results of a simple 
regression of performance on the fraction of lectures at¬ 
tended. 6 The regression reveals a statistically significant and 
quantitatively large relation between attendance and perfor- : 
mance. The /-statistic on attendance is 6.2; the point estimates 
imply that a student who attends only a quarter of tire lectures 
on average earns a 1.79 (C—), while a student who attends all 
of the lectures on average earns a 3.44 (B+). Attendance 
alone accounts for 31 percent of tire variance in performance. ! 

——- ______-- j) : 

6 There is one econometric complication worth mentioning: because at- : 

tendance was not taken at ever)' class meeting, some of the variation l 

across students in measured attendance is due to measurement error ! 

rather titan to true differences in attendance over the whole semester. If | 

the class meetings at which attendance was taken were a random sample i 

of all the meetings—which appears to be a good approximation—it is 
straightforward to estimate the size of die measurement error. This pro¬ 
cedure implies that 38 percent of die variation in measured attendance 
represents measurement error. This estimate can be used to correct the 
regression coefficients, standard errors, and f? 2 ’s for the bias that would 
otherwise be introduced by the measurement error. All of die results re¬ 
ported in Table 2 have been corrected in this way. 
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Table 2 The Relationship Between Attendance and Performance 



(V 

121 

m 

141 

15) 

Sample: 

Full 

Restricted 

Full 

Full 

Restricted 

Constant 

1.25 

1.02 

1.07 

-0.67 

-0.78 


(0.27) 

(0.58) 

(0.23) 

(0.32) 

(0.43) 

Fraction of 

2.19 

2.47 

1.74 

1.52 

1.38 

Lectures 

(0.35) 

(0.70) 

(0.46) 

(0.32) 

(0.58) 

Attended 






Fraction of 



0.60 



Problem Sets 



(0.32) 



Completed 






Prior GPA 




0.78 

0.86 





(0.12) 

(0.14) 

Sample Size 

195 

116 

195 

195 

116 

n 1 

0.31 

0.26 

0.33 

0.47 

0.48 


Standard errors are in parentheses. The restricted sample consists of the students who 
completed all of the problem sets. 

Students who are more interested in the material, or 
more skilled academically, or more focused on academics 
are almost certain to attend class more often than students 
who are less interested, less skilled, or less focused (other 
factors held constant). If this is the case, then the results in 
Column 1 of Table 2 to some extent reflect a general im¬ 
pact of motivation on performance rather than a true ef¬ 
fect of attendance. 

I attempt to address this problem in three ways. First, I 
restrict the sample to the 60 percent of the students who 
did all nine problem sets. It seems likely that most of the 
students who were not devoting serious effort to the course 
did not complete all of the problem sets. In addition, the 
lowest problem set score was dropped in computing the 
course grade; thus the students who completed all nine 
may have been especially motivated. On both grounds, this 
restricted sample may be more homogeneous in terms of 
general motivation than the full class. But as the second 
column of Table 2 shows, the relation between attendance 
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and performance in this sample is actually slightly stronger 
than for the class as a whole. 

Second, doing the problem sets is arguably as good a 
proxy as attending the lectures for motivation. But Col¬ 
umn 3 shows that there is a much stronger relation be¬ 
tween attendance and performance than between doing 
the problem sets and performance: when both variables 
are entered in the regression, the coefficient on the frac¬ 
tion of lectures attended is almost three times as large as 
the coefficient on the fraction of problem sets completed. 
Thus, either attendance is a much better proxy than com¬ 
pleting the problem sets for motivation, or attendance has 
a large additional impact on performance. 

Third, data were obtained on students’ grade point av¬ 
erages as of the beginning of the semester. Including GPA 
as a control variable in the regression serves to control for 
some of the differences across students in general ability 
and motivation. In fact, because students’ academic per¬ 
formance in previous classes depends in part on their at¬ 
tendance in those classes, the coefficient on prior GPA will 
capture some of the effect of attendance on performance; 
as a result, including GPA as a control variable could cause 
the coefficient on attendance to understate the true impact 
of attendance on performance. 

Column 4 of Table 2 shows the effects of including 
grade point average in the regression. Prior GPA has an 
extremely strong relation with performance. But the in¬ 
clusion of GPA has little impact on the relation between 
attendance and performance. The coefficient on atten¬ 
dance is two-thirds as large as it is in the basic regression 
in Column 1, and it remains highly significant. The point 
estimates imply that a student with the mean prior GPA 
earns on average a 2.13 (C) if he or she attends a quarter 
of the lectures but a 3.27 (B+) if his or her attendance is 
perfect. 

Finally, Column 5 shows the results of both restrict¬ 
ing the sample to the students who did all nine problem 
sets and controlling for prior GPA. Even in this case, the 
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relation between attendance and performance remains 
large and significant. The estimates imply that a student 
with the mean prior GPA earns on average a C+ if he or 
she attends only a quarter of the classes, compared to a 
B+ if attendance is perfect. 

None of these ways of attempting to address tire prob¬ 
lem that attendance is not exogenous is definitive. 
Nonetheless, they all give similar results: simple ways of 
controlling for motivation and other omitted factors have 
only a moderate impact on the relationship between at¬ 
tendance and performance. Thus, although the possibility 
that the relationship reflects tire impact of omitted factors 
rather than a true effect cannot be ruled out, it seems likely 
that an important part of tire relationship reflects a gen¬ 
uine effect of attendance. 

Should Attendance Be Mandatory? 

Absenteeism is rampant in undergraduate economics 
courses at major American universities. In addition, there is a 
very strong statistical relationship between absenteeism and 
performance, and the evidence is consistent with tire view 
that this relationship has an important causal component. 

These results raise tire question of whether measures 
should be taken to combat absenteeism. At tire very least, ex¬ 
hortations to attend class seem called for, and those exhor¬ 
tations can be backed up with data. But stronger measures 
might be preferable. A generation ago, both in principle 
and in practice, attendance at class was not optional. Today, 
often in principle and almost always in practice, it is. Per¬ 
haps a return to tire old system would make a large differ¬ 
ence to learning. There is no way to find out but to try. I 
believe that tire results here both about tire extent of absen¬ 
teeism and its relation to performance are suggestive 
enough to warrant experimenting with making class atten¬ 
dance mandatory in some undergraduate lecture courses. 

One could also use mandatory attendance to perform a 
genuine controlled experiment that could isolate tire true 
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impact of attendance on master)' of the material. Specifi¬ 
cally, one could randomly divide the students in a course 
into two groups, an experimental group whose grading was 
based in part on attendance and a control group whose 
grading was not. By comparing the attendance and the 
performance of the two groups, one could learn both the 
impact of mandatory attendance on absenteeism and tire im¬ 
pact of attendance on performance. 7 Unless either the im¬ 
pact of mandatory attendance on absenteeism or the size 
of the class were very large, the results of carrying out this 
experiment for a single class would not allow one to esti¬ 
mate the impact of attendance on performance with much 
precision. But the pooled results from several such exper¬ 
iments could. 

B I am grateful to Caroline Fohlin, Matthew Jones, and Costas Tsatsaro- 
nis for excellent research assistance, and to Robert Cox, Roger Farmer, 
Steven Fazzari, Alan Krueger, Christina Romer, Paul Ruud, Joseph 
Stiglitz, Timothy Taylor, and Robert Turner for useful comments. 
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7. Such an experiment would presumably require appropriate approval. 
Students could be given the right to opt out of the experiment by being 
allowed to choose (before the class is divided into the experimental 
and control groups) to have their grade based on a formula that gave 
attendance half the weight used in the grading formula for the experi¬ 
mental group. Fairness could be ensured by assigning grades to all stu¬ 
dents using all three procedures (experimental, control, and opting 
out), and making the mean grade for the full class the same under all 
three procedures, before the allocation of the students to the three 
groups was known to the person assigning grades. 




As you go through the Romer article, ask yourself what role each part 
of tire text plays in tire author’s argument. For example, what is tire au¬ 
thor’s principal assertion? What are the major pieces of evidence he uses 
to support tire assertion? Note that, just as in many economics papers, the 
sections of Romer’s paper are not labeled “Introduction,” “Theory,” “Em¬ 
pirical Testing,” and “Conclusions,” but those pieces are in the paper 
nonetheless. 

1. What question is the author asking? This may be tire most important 
question in Table 6.1. If you can’t determine this, you are probably 
wasting your time reading the paper. The research question should 
be found in the introductory section of the paper. Romer’s paper is 
a complex argument consisting of three assertions, each of which 
leads logically to the next. He summarizes his argument in the open¬ 
ing paragraph, where tire key question is: 

"How much, if at all, does absenteeism affect learning?" 

2. What answer does the author propose? The author’s proposed an¬ 
swer, that is, the principal assertion of his or her argument, can of¬ 
ten be found in the introduction to the paper. Romer states this in 
the second paragraph: 

"Regression estimates of the relation between attendance and perfor¬ 
mance in one large lecture class suggest that attendance may substan¬ 
tially affect learning." 

If the author’s proposed answer isn’t in tire introduction, it should 
always be dearly stated in the concluding section of the paper. In the 
final section, Romer recapitulates: 

"U]here is a very strong statistical relationship between absenteeism and 
performance, and the evidence is consistent with the view that this rela¬ 
tionship has an important causal component." 

3. In what ways does the study improve on previous research? The in¬ 
troduction to tire paper should indicate how the current research is 
an improvement over its predecessors. In this case, the author indi¬ 
cates on page 1 (in a footnote): 

"The present study differs from . . . earlier work in focusing on the quan¬ 
titative magnitude of the relationship and on the issue of the extent to 
which the relationship reflects a genuine effect of attendance." 

Often, though not in this case, the distinctive features of the present 
research are also noted in the concluding section of the paper. 
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4. How does the proposed answer compare with that provided by pre¬ 
vious research? The answers provided by previous researchers are 
usually found in the literature survey in tire paper’s introduction, 
which in this case is limited to a footnote on page 1: 

"These [previous] studies generally confirm the findings here that atten¬ 
dance and performance are related even when a variety of student char¬ 
acteristics are controlled for." 

Additionally, the conclusion often indicates how the present results 
were either comparable to or different from those of previous re¬ 
searchers. 

5. What are the major logical or theoretical reasons for the author’s ar¬ 
gument? The author’s argument consists of the principal assertion 
(i.e., the author’s proposed answer to the research question), along 
with logical and theoretical reasoning, empirical evidence, and un¬ 
derlying assumptions. The theoretical reasoning can be found in 
the paper’s analytical section. Romer’s theoretical argument, since 
it deals with a topic we are all familiar with, is largely implicit: 

"Lectures and other class meetings are a primary means of instruction in 
almost all undergraduate courses." 

This is the first sentence of the paper. It implies that class attendance 
leads to learning. 

6. What empirical evidence does the author provide? Empirical evi¬ 
dence is found in the section of the paper on empirical testing. This 
includes the overall testing methodology, the results, and their in¬ 
terpretation. The first part of Romer’s empirical argument is to es¬ 
timate the extent to which absenteeism occurs: 

"In short, on a typical day at a typical elite American university, roughly 
one-third of the students in economics courses are not attending class." 

Romer then goes on to expand on this point: 

"Course size appears to have an important effect on absenteeism. . . . 
[Absenteeism is considerably lower in the smallest third of classes than in 
the largest third.... Absenteeism is also lower in courses with a significant 
mathematical component. . . . Similarly, . . . absenteeism is somewhat 
higher in core courses than in field courses. Finally, . . . students attend 
classes more often when the [perceived] quality of instruction is higher." 

In tire second part of Romer’s empirical argument, he reports on a 
series of regressions to estimate the relationship between attendance 



and performance in class when controlling for a number of outside 
variables, including students’ prior GPA and student motivation. 

"A simple regression of performance on the fraction of lectures attended 
. . , reveals a statistically significant and quantitatively large relation be¬ 
tween attendance and performance. . . . Simple ways of controlling for 
motivation and other omitted factors have only a moderate impact on the 
relationship between attendance and performance." 

7. What assumptions is the author making in his reasoning? Romer, like 
all researchers, makes a number of assumptions in his argument. 
These include that GPA is a good measure of student learning, that 
the results he observed at the “elite” universities he studied are gen- 
eralizable to all institutions of higher education, and that econom¬ 
ics students and courses are similar to other college students and 
courses. 

When you find it difficult to dissect a scholarly article, you may take 
heart from the following observation by Trelogan (2001): 

Why did it take so long to identify the arguments and to get clear 
about their premises and conclusions? The answer is that there is no 
mechanical procedure for doing this and that quite in general, read¬ 
ing—really reading—is a difficult art. We had to read carefully and an¬ 
alytically, and we had to do some serious thinking about the content 
and the structure of the passage we were examining. 

An additional complication, as a number of commentators have noted, 
is that many scholarly works are not well written. 

Evaluating Published Research 

Once you’ve identified the argument in a published work, the next step 
is to evaluate the argument, that is, to assess its validity and reliability. This 
is not a trivial task, so let’s begin with the easy questions. 

Does the author have an apparent conflict of interest? For example, is 
the research funded, sponsored, or published by an organization that 
might be considered less than objective about its findings (e.g., research 
funded by the tobacco companies that finds no adverse health effects 
from smoking)? A somewhat milder version of this concern: does the au¬ 
thor or the journal promote a particular point of view (e.g., liberal views, 
conservative views)? If so, you should factor this into your evaluation of 
the work. 

Is the study published in a refereed journal? We noted in Chapter 3 that 
scholarly articles published in professional journals are likely to have un- 
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dergone a formal evaluation process. Multiple experts in the field review 
the articles blindly, that is, without being aware of tire identities of the au¬ 
thors so as to keep their evaluations objective. 2 The more prestigious the 
journal, the more rigorous the evaluation, and tire more likely tire author’s 
argument is to be valid. Thus, articles published in tire American Economic 
Review are likely to be of higher quality and greater importance than arti¬ 
cles published in, for example, a regional journal. A similar review process 
takes place for academic books before they are published. Thus, when a 
study is published, novice researchers can assume that it has a certain 
amount of validity. 

Another tiring to remember is that tire usefulness of a published study 
depends on more than the ostensible quality of tire journal in which it is 
published. For example, James Tobin (1978) published an important, very 
widely cited article (“A Proposal for Monetary Reform”) in the Eastern Eco¬ 
nomic Journal, a relatively minor journal. By contrast, numerous articles 
have been published in top journals and were never cited again, or, if they 
were, they haven’t stood the test of time. In short, the ultimate validity of 
a piece of research comes only after time, when many other researchers 
have tested that theory in numerous different ways. Thus, when a scholar 
i eads a i esearch study, especially a recent one, he or she will need to criti¬ 
cally evaluate it for him- or herself. 

More Questions to Guide Critical Reading 

Table 6.2 lists a series of questions to help you evaluate an author’s ar¬ 
gument. Natui ally, evaluating a scholarly argument is inherently more dif¬ 
ficult than simply identifying it in a published study. Several of the issues 
discussed here will be dealt with in much greater detail in the remaining 
chapters of this book. Here, we just introduce them. 


Evaluating the Author's Argu ment 

1 Does the theoretical analysis make sense? 

2. Are the data used adequate to the task? 

3. Does the empirical methodology adequately test the hypothesis? 

4. Are the assumptions reasonable? 

5. Is the analysis (theoretical and empirical) clearly explained? 

6. Do the conclusions follow from the evidence presented? 

7. On balance, is the author's argument convincing to you? 

iim 
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As we discuss tiiese questions, let’s use them to assess Romer’s (1993) 
article. 

1. Does the theoretical analysis make sense? Do the reasons adequately 
lead to and support the hypothesis? Is die argument deep enough? 
Is the argument broad enough to be comincing? Are diere other 
reasons that need to be brought into die analysis? Is diere an alter¬ 
native explanation of the research problem diat makes as much 
sense as that proposed by the audior? 

We identified Romer's argument in the answers to questions 2, 5, 6, and 
7 in Table 6.1. Paraphrasing those answers gives us: 

"Lecture remains the primary pedagogical tool in undergraduate eco¬ 
nomics. Estimates of attendance in undergraduate economics courses at 
three top universities indicate that on average one-third of students are 
absent from class. Regression results on the effects of attendance on GPA, 
when controlling for motivation and student quality, indicate that atten¬ 
dance has a large and significant positive effect on student performance." 

This reasoning makes sense, since each point leads to the next and ulti¬ 
mately to the conclusion. The theoretical foundation—that class atten¬ 
dance should be important to learning—is logical. (Note, however, that 
valid reasoning doesn't rule out the possibility that there's a contrary log¬ 
ical argument—for example, that students could use the textbook to sub¬ 
stitute for class attendance.) 

2. Are the data used adequate to the task? Are the data completely doc¬ 
umented? Is die data set from an authoritative source? Are the data 
a representative sample or a special case? If die latter, are the results 
applicable to the situation you care about (e.g., does a study using 
Virginia data generalize to the United States as a whole)? 

The data on class attendance were taken during one meeting of a "typi¬ 
cal" week in all undergraduate economics courses at three "relatively 
elite " institutions. Though he did not identify the specific schools, Romer 
described one as a "medium sized private university," the second as a 
"large public university," and the last as a "small liberal arts college." 
More detailed data were obtained from another "large public university." 
Romer also pointed out several small shortcomings in the data. 

The data used to assess the relationship between class attendance and 
performance were obtained from Romer's intermediate macro course. 
The data on attendance were taken from a sample of six class meetings. 
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The data were adequately documented, but it is reasonable to question 
how representative they might be for undergraduate economics students 
at schools nationwide. On the other hand, it is important to recognize that 
data are never perfect. 

3. Does the empirical methodology adequately test the hypothesis? If 
you obtain the best possible results, how confident are you that the 
hypothesis is correct? Does the methodology clearly discriminate be¬ 
tween die hypothesis and alternative interpretations of the evidence? 

Subject to the possible shortcomings of the data just described, the re¬ 
gression analysis is a reasonable and commonly used methodology for 
testing questions such as the ones Romer posed. He attempts to control 
for motivation and student quality, which is an improvement over previous 
work. It is to his credit that Romer points out that his approach does not 
definitively deal with the problem that class attendance is not exogenous. 

4. Are die assumptions reasonable? If die assumptions are factual, are 
they true? If they are empirical (e.g., perfect competition) how crit¬ 
ical are they to reaching the author’s conclusion? If they are critical, 
are they reasonable? For example, are they commonly made in this 
literature? Do the data satisfy the assumptions of the testing method- 
ology (e.g., if the author uses an ordinary-least-squares (OLS) re¬ 
gression, are the requirements for OLS satisfied)? 

Most researchers would accept that GPA is a reasonable, if imperfect, 
measure of student learning. Romer's sample of data is a special case, 
and thoughtful researchers will be concerned about how representative 
it is. However, lacking evidence to the contrary, most would accept it 
provisionally. 

5. Is the analysis clearly explained? Does die description omit any steps 
that are necessary to reach the conclusion? (Recall that this may be 
a result of space limitations or instructions from the editors about 
omitting generally accepted warrants.) What is it that isn’t clear? Is 
it terminology? Are all tables and graphs discussed and explained? 
Ruggiero (1998, 65) indicates that well-written books or articles 
state their principal argument clearly and visibly. Then he observes, 
“Not every article or book, unfortunately, is well written.” 

Romer's article is well written, and clearly organized, even though it does 
not use the typical format for a scholarly article. The main points in his ar¬ 
gument are clearly explained; the tables serve to illustrate the argument. 





v*irnpici u - umiuai ncaumy ui i iuvv tu iviarvc ociiac ui ruunsiicu nestiarcn 


6. Do the conclusions follow from the evidence presented? Does the 
author correctly interpret the empirical results? How well does the 
empirical evidence support the conclusion? Does die author ad¬ 
dress conflicting evidence or views? Is there an alternative explana¬ 
tion drat fits the evidence? How do the results of this study compare 
witii previous ones? If diey are comparable, drat is a point in favor 
of this study. If drey are not, are convincing reasons supplied for why 
this is so? Such reasons might include that die current study has bet¬ 
ter experimental controls, a different data set, or more complete or 
o then vise better data. 

This paper is more empirical than theoretical. The regression results 
strongly support the assertions that a significant portion of students do not 
attend class regularly and that class absences adversely affect student 
performance. The conclusion that faculty should seriously consider 
stronger measures to encourage class attendance follows directly from 
this evidence. Additionally, the findings are consistent with earlier studies. 

7. On balance, is the author’s argument convincing to you? In light of 
the reasons and evidence provided, is die argument persuasive? In 
other words, is the supporting evidence consistent with the hypothe¬ 
sis? If so, die argument may be considered valid. Does the study actu¬ 
ally answer the research question (s) ? For example, was there anydiing 
you expected to see, but didn’t? Did die paper show signs of careless¬ 
ness, such as failing to provide full references to all citations? Are 
there significant studies on this topic that the author does appears un¬ 
familiar with? Did the audior identify limitations of the study? 

Romer's argument is very well done. We can conclude that the article is 
convincing, subject to the limits of its methodology. 

Though evaluating scholarly work requires that die reader make a judg¬ 
ment about its quality, Ruggiero (1998, 67) points out that it is not always 
necessary to agree or disagree in total with an author’s argument: 

If you agree in part and disagree in part, explain exactly what your 
position is and support it carefully. ... If some vagueness or ambi¬ 
guity in the author's argument prevents you from giving a flat answer, 
don't attempt one. Rather, say, "it depends," and go on to explain.. . . 

If you must deal with conflicting testimony and cannot decide your 
position with certainty, identify the conflict and explain why you can¬ 
not be certain. 

Evaluating published research is as much an art as a science, and we will 
continue to discuss it in the remainder of diis book. As Locke et al. (1998, 
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71) clearly state, “To be honest, it takes years of experience to quickly dis¬ 
cern flaws of logic and imperfections of analysis in a complicated investi¬ 
gation.” Undergraduates are sometimes reluctant to commit themselves 
definitively, especially when they feel tiiey don’t fully understand some¬ 
thing. You shouldn’t use either Ruggiero’s or Locke et al.’s points as an ex¬ 
cuse to avoid drawing a conclusion about a published work. 

Taking Research Notes and Writing Abstracts 
and Criticai Reviews 

Earlier in this chapter, we briefly discussed the note-taking process. 
Let’s consider it in more detail now. When you take notes on a reading you 
read it more carefully. The first time you read an article, you should focus 
on the big picture. Instead of taking notes on each paragraph, force your¬ 
self to read the entire section, then ask yourself: what did it say? Only after 
you can articulate the point of the section should you go back and take 
notes on the details. 

This suggests a related point. Good readers vary their speed and depth 
of reading to match their purpose—whether they are interested in getting 
the main ideas or all tire details. For example, as we noted previously, you 
don’t need to grasp all the nuances of the particular statistical technique 
an author uses to get a basic understanding of the results. 

The notes you take while reading a scholarly article are really addressed 
to you. After you finish tire article, it is a good idea to transcribe your notes 
onto paper (e.g., index cards) or a computer file. Always start by record¬ 
ing the complete bibliographic information using the appropriate citation 
style. You will be amazed how much harder it is to track down an article to 
obtain citation information the night before a paper is due. Save yourself 
the trouble and get it right the first time. 

Indeed, it is good practice to make copies of all articles (or chapters of 
books) that are important to your research project. Though this costs a bit 
for duplicating or printing fees, it is a small price to pay to avoid time and 
aggravation later. This is especially true for those articles you need to read 
more than once. When you copy an article, make sure you include the 
whole article, including references and appendices. You should also in¬ 
clude a copy of the title page of the journal, indicating journal title, vol¬ 
ume and issue number, and date. 

As a general rale, it is better to paraphrase what you read than to record 
it verbatim. Notes that are collections of direct quotations suggest that you 
don’t understand what you read or you haven’t made the effort to under¬ 
stand. Paraphrasing requires you to process the information, which means 
that you will learn it better. In addition, as you paraphrase you prepare die 



ideas for inclusion in your own writing. If you use a highlighter when you 
read, and you Find yourself highlighting a great deal of the text, it might 
be useful to make an effort to think more about what you’re reading and 
use tire highlighter less—for example, only on tire truly important aspects 
of the text. Finally, if you need help getting through an article, talk to your 
instructor. 

You can make tire notes you write on a scholarly work more formal by 
turning them into an abstract or a critical review. Both are natural prod¬ 
ucts of tire critical reading process. Most people think of an abstract as a 
summary of a scholarly article or book, but a better way to think of it is as 
a summary of tire author’s argument (together with complete citation in¬ 
formation). As such, as Cohen & Spencer (1993, 223) point out, an ab¬ 
stract may present the material in a different order than that of the 
original article. Scholarly works often include an abstract, but it is good 
practice to write your own. If you can do it well, you dearly understand the 
article or book. 

Let’s write an abstract for Romer (1993), the article we read critically 
earlier. Every abstract begins with complete bibliographic information for 
the work. You should consult your instructor (if your paper is an assign¬ 
ment for a course) or the style guide (if it is a submission to a journal) to 
determine tire correct citation style. The style used in Table 6.3 is Style B 
of the Chicago Manual of Style, Fourteenth Edition. 

Recall from Chapter 4 that an argument is a thesis supported by major 
theoretical and empirical reasons, which are themselves fleshed out by mi¬ 
nor reasons and other details. In an abstract of one hundred to two hun¬ 
dred words, you only have enough space to state the thesis and the major 
reasons. Based on the answers to the questions in Table 6.1, it should be 
easy to identify these. The thesis of the article is the answer to question 2 
in the table: What is the author’s proposed answer to the research ques¬ 
tion? The theoretical reasoning is the answer to question 5. The empirical 
evidence is the answer to question 6. Putting this all together and con¬ 
densing it yields the sample abstract in Table 6.3. 3 

A critical review is an abstract augmented with a critical evaluation of 
tire work. Thus, it includes complete citation information, a summaiy of 
the author’s argument, and an assessment of that argument. Such an eval¬ 
uation can be easily written by summarizing the answers to the questions 
in Table 6.2, as shown in Table 6.4. 

A related research product is called an annotated bibliography. This is 
a list of references that includes a few sentences summarizing and cri¬ 
tiquing each item. In other words, it is something like a collection of crit¬ 
ical reviews on a single research topic. An annotated bibliography is a good 
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Romer, David. 1993. Do students go to class? Should they? Journal of 
Economic Perspectives 7 (Summer): 167-174. 

Estimates of attendance in undergraduate economics courses at 
three top universities indicate that on average one-third of students 
are absent from class. Regression results from a large intermediate 
macro course of the effects of attendance on GPA, when controlling 
for motivation and student quality, indicate that attendance has a 
large and significant positive effect on student performance. Romer 
argues, as a consequence, that faculty should seriously consider 
ways to increase student attendance. 


Romer, David. 1993. Do students go to class? Should they? Journal of 
Economic Perspectives 1 (Summer): 167-74. . , 

Estimates of attendance in undergraduate economics courses at 
three top universities indicate that on average one-third of students 
are absent from class. Regression results from a large intermediate 
macro course of the effects of attendance on GPA, when controlling 
for motivation and student quality, indicate that attendance has a 
. large and significant positive effect on student performance. Romer 
argues, as a consequence, that faculty should seriously consider : 

\ ; ways to increase student attendance. ; A? 

The theoretical reasoning in this paper makes sense The empirical 
methodology seems appropriate, and the results are consistent with 
the hypothesis. The principal shortcoming is the possibly nonrepre¬ 
sentative data sample. On balance, however, the article is convincing, 
subject to the limits of its methodology. Any researcher doing work 

in this area should be familiar with this article. 


way to determine how complete your literature review is. It is also very use¬ 
ful for preparing the written literature survey for a research report. The 
Suggestions for Further Reading at the end of each chapter in this book 
are examples of annotated bibliographies. See also the annotated list of ar¬ 
ticles in Appendix 6A. 

The only way to learn how to read and understand scholarly research is 
to practice this important skill. Reading through diis chapter is not 
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enough! The more you practice, the sooner you will become adept at read¬ 
ing critically..To that end, in addition to the paper reproduced later in this 
chapter, another paper is also included on the website for your review. 
Please be sure to read it as well. 

SUMMARY 

o Scholarly literature is challenging to read. 

e Scholarly writing follows a specific format like a lab report. 

o You must read critically to discern and evaluate an author’s argument. 

« Critical reading requires judgment on the reader s part. 

© A research abstract is a summary of an author s argument. 

• A critical review is an evaluation of an author’s argument. 


NOTES 

1. There is another type of scholarly work, called a meta-analysis, that is 
similar to a survey paper but includes original research. A meta-analysis 
uses sophisticated statistical techniques to analyze the separate results 
of previous studies. It does this to discern insights that may not be ap¬ 
parent from any individual study, similar to the way a mathematical av¬ 
erage provides a single measure that summarizes a broad sample of 
data. Meta-analyses are much more than mere averages, however, and 
lie outside the scope of this book. 

2. Note, however, that not all journals have a peer-review process. If an ar¬ 
ticle is peer-reviewed, it is probably of high quality. If an article is pub¬ 
lished in a non-peer-reviewed journal, it may still be of high quality, but 
you can’t assume that as easily. 

3. Note that since the answers to the questions from Table 6.1 were exact 
quotations from the article, I also paraphrased to produce the abstract 
shown in Table 6.3. 

SUGGESTIONS FOR FURTHER READING 

Bauer (2003 }—A guide to “the great books,” the first four chapters provide 
an excellent discussion of how to read scholarly texts critically. 

Bean (1996)— Chapter 8, “Helping Students Read Difficult Texts,” is ad¬ 
dressed to instructors teaching critical reading, but students can bene¬ 
fit from it as well. 




Locke et al. (1998) —Monograph on understanding scholarly writing. The 
best parts are Chapter 4 on reading research reports, Chapter 5 on tak¬ 
ing notes, and Chapter 7 on critical reading per se. 

Ruggiero (1998) —Introduction to critical thinking. Chapter 4 provides 
thoughtful suggestions for critical reading. 

Schroeder et al. (1985) —Useful survey of how to understand scholarly arti¬ 
cles in psychology, but applicable to studies in the social sciences more 
generally. 

Trelogan (2001 )—Helpful web-based introduction to critical thinking and 
reading. 

Wyrick (1994) —Chapter 9, on the economic content of published re¬ 
search, is quite good though it emphasizes the format of economic writ¬ 
ing, rather than the argument. 

EXERCISES 

1. Find a scholarly article on your research topic. Read it critically, and us¬ 
ing the article write down answers to the questions in Tables 6.1 and 6.2. 

2. Write an abstract for the article you used in Exercise 1. 

3. Write a critical review of the article. 

4. Choose two articles on the same topic but with conflicting conclusions. 
Write a critical review of the two articles, and explain which article’s 
conclusion you found tire most persuasive and why. 

5. Prepare an annotated bibliography for your research topic. The bibli¬ 
ography should include at least three sources: an article from a schol¬ 
arly journal, a book, and an article from the Internet. Each item should 
have the three elements required for a critical review: 

® The complete bibliographic citation in the citation style required by 
your instructor; 

® A brief summary of the argument; please note if the source is theo¬ 
retical, empirical, or a survey; and 

6 An assessment of the quality of the item. 
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Reading and Evaluating a 
Theoretical Article 


Reading and understanding a theoretical journal article is a little dif¬ 
ferent than doing so for an empirical article. Typically, in a theoretical 
article, the analysis is more abstract or more in depth, which makes read¬ 
ing and understanding even more challenging. Before continuing, you 
may wish to read Chapter 7 for help in understanding theoretical analy¬ 
ses. Try reading the following theoretical articles, which I have anno¬ 
tated. In recent years, purely theoretical articles have become highly 
mathematical. For that reason, the following selections are taken from 
earlier decades. 


Sample Theoretical Journal Articles 

Rrugman, Paul. 1979. Increasing returns, imperfect competition, and in¬ 
ternational trade, Journal of International Economics 9 (November): 
469-479. 


This article demonstrates that international trade can result from 
economies of scale, even when both countries have identical prefer¬ 
ences and factor endowments. This article offers much to commend it 
to researchers new to theoretical studies. First, it is well written; even the 
analytical parts are clearly explained verbally. Second, though the de¬ 
tails of tire analysis require an understanding of constrained optimiza¬ 
tion using calculus, the analysis itself should be readily understandable 
to anyone who has taken intermediate theory. More precisely, the analy¬ 
sis consists of utility and profit maximization. Finally, the article is short, 
less than ten pages long. 

Becker, Gary. 1962. Irrational behavior and economic theory, Journal of 
Political Economy IQ (February): 1-13. 

This article established that maximizing behavior is sufficient but not 
necessary for obtaining downward-sloping demand curves. Indeed, sev¬ 
eral forms of irrational behavior also lead to downward-sloping demand 
curves as long as economic agents are subject to a budget constraint. 
The article uses graphs and logic but no higher math. 
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Tobin, James. 1956. The interest elasticity of tire transactions demand for 
cash, Review of Economics and Statistics '38 (September): 241-247. 

This classic article demonstrated that the transactions demand for 
money is a function of interest rates as well as income. Previously, only 
the speculative demand for money was thought to be interest sensitive. 

Leibenstein, Harvey. 1950. Bandwagon, snob, and Veblen effects in the 
tlr eory of consumers’ demand, Quarterly Journal of Economics 64 (May): 
183-207. 

This study makes a graphical and verbal argument for nontraditional 
effects of price on quantity demand—for example, that high prices can 
induce consumers to buy because of the sirob effect, which is shown by 
a shift in the demand curve. 
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“Man, because he is endowed with 
reason by which he is able to perceive 
relationships, sees the causes of things, 


understands tne reciprocal nature ot 
cause and effect, makes analogies, easily 
surveys the whole course of his life, and 
makes the necessary preparations for 

its conduct." 
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n this chapter, we explore how scholars perform the theoretical or 
conceptual analysis of economic research. The result is often called 
' J the theoretical model of one’s research. As we observed in Chapter 
;32S2, in a real sense this is the crux of the research project: where the¬ 
ory is applied to shed light on the problem being studied. Many scholars 
have noted that developing the conceptual framework is difficult for 
novice researchers. For one thing, it is the most abstract aspect of die re¬ 
search process, requiring both analysis of the research issue or problem 
(in the sense of taking it apart to understand it) and synthesis of an ap¬ 
propriate conceptual framework to explain it. It also requires that schol¬ 
ars have sufficient knowledge of die appropriate economic theory on 
which to build. Let’s see if we can explain it. 






The underlying theme of this chapter is what it means to “apply” theory 
to a research topic. We will begin by discussing what theorizing means in 
general terms. We then explain tire range of ways in which researchers the¬ 
orize, from narrative reasoning to mathematical reasoning. We also pro¬ 
ride examples to make dris difficult topic as concrete as possible. Next, we 
present a shortcut that scholar's commonly use: modifying an existing the¬ 
oretical model rather than creating one from scratch. Finally, we identify 
tire characteristics of a good research hypothesis, which is the product of 
tire theorizing process. 

What Does It Mean to Apply Theory 
to a Research Topic? 

Air economic research project is one in which economic theory is ap¬ 
plied to an issue so as to derive insights about it. What exactly does it mean 
to “apply” economic theory to air issue? Recall the major theories you 
learned in your principles of economics classes. On the micro side, the ma¬ 
jor theories include supply and demand, production and cost, the theories 
of the firm, and consumer behavior. On the macro side, the major theo¬ 
ries are aggregate demand and supply, consumption expenditure, invest¬ 
ment demand, and money demand. When you apply a theory to an issue 
you ask yourself, “Can this question be expressed in terms of one of these 
theories?” In other words, you ask whether your topic is related to de¬ 
mand, supply, production, and so on. 

Let’s start with a straightforward example. Suppose your research ques¬ 
tion involves retail diamond jewelry sales. Can you articulate this as an ex¬ 
ample of tire theory of demand? The theory of demand says that the 
quantity demanded of some item is influenced by tire price of the product; 
by tire consumer’s income, tastes, and preferences; and by the prices of re¬ 
lated goods. If we treat diamond jewelry sales as reflecting tire demand for 
diamond jewelry, what would be the analogous explanatory variables? 
These might include diamond prices, income, wedding engagement rates 
as an indicator of tastes, and prices of other types of jewelry. We can con¬ 
clude that the theory of demand can likely be used to analyze retail dia¬ 
mond jewelry sales. Note that whether tire theory actually works in practice 
will be determined by empirical testing. 

What Is Theorizing? 

If after you’ve given it some thought, it is not obvious what economic 
theory should be applied to your research, you may need to do some for¬ 
mal theorizing about your topic. Theorizing is tire process of brainstorm- 
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Applying Economic Theory 

As you review the literature for your research project, you should 
ask yourself for each study that you read what economic theory is 
being applied. 


ing about an issue so as to identify tire logical connections that explain the 
issue. The result of tire process is a theory that analyzes the research ques¬ 
tion, and in particular provides an answer to the question in the form of 
the research hypothesis. 

When you theorize, you ask three questions: 

1. What are the essential concepts involved in the problem being 
researched? 

2. How are the essential concepts related? 1 

3. What implications or predictions can be drawn from these rela¬ 
tionships? 

Theorizing involves constructing a conceptual or theoretical argument. 
As we noted in Chapter 4, this involves deductive reasoning. As you con¬ 
struct this argument, you will need to think both deeply and broadly about 
the concepts. Let’s work through a simple example, and then we’ll look in 
more detail at tire process. 

Suppose we are investigating the extent to which the U.S. macroeco¬ 
nomic slowdown was caused by the decline in the stock market. How do 
we “conceptualize” dais research question? What are tire major concepts 
involved? The initial two concepts are tire economic slowdown and the de¬ 
cline in the stock market. What does economic th eory suggest might cause 
a slowdown in the economy? One answer would be a decrease in aggregate 
demand. What could cause such a decrease? Since aggregate demand is 
tire sum of consumption expenditure, investment expenditure, govern¬ 
ment expenditure, and net export expenditure, a decrease in any of those 
components would cause a decrease in aggregate demand. Let’s consider 
how each of these might be related to the other major concept—a decline 
in tire stock market. Theory tells us that though consumer spending is pri¬ 
marily dependent on disposable income, it also depends on household 
wealth. If a decline in the stock market makes individuals feel less wealthy, 
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Mathematical Reasoning 


it may well cause a decrease in consumer spending. A decline in the stock 
market could have both a direct and indirect effect on investment spend¬ 
ing. To the extent that businesses obtain investment financing by issuing 
shares of stock, a decline in die market makes that more difficult. That is 
die direct effect. The indirect effect may be even stronger. If the stock mar¬ 
ket decline reduces consumer spending, it also reduces the demand for 
business output and dierefore business investment. 

Let’s restate what we have just described. 

First, we asked, given the major concepts in die problem, what eco¬ 
nomic theory or dieories can shed light on them? In this case, it was theo¬ 
ries of consumer spending and investment demand. Then, we asked, how 
could one apply those theories to the current research problem? Can we 
determine a logical connection between the stock market decline and the 
macroeconomic slowdown? Yes. 

What are die implications of that hypothetical story? In other words, 
what does diis theory predict? If die logic is correct, then a decline in the 
stock market should cause an economic slowdown as a result of reduced 
investment and consumer spending. This implication can be tested. 

Notice the distinction between existing economic theory and the re¬ 
sulting theory that is developed for a specific research project. In the cur¬ 
rent example, the existing theory is the theory of consumer spending; in 
the earlier example, it is the theory of demand. In die current example, 
die resulting tiieory is die effects of the stock market on consumer spend¬ 
ing; in the earlier example, it is die demand for diamond jewelry. In each 
case, the specific theory is an application of the existing, more general the¬ 
ory. 2 For example, any introductory economics student knows that the 
general theory of demand postulates a negative relation between price 
and quantity demanded. For the specific research project described ear¬ 
lier, the relevant question could be, how sensitive is the demand for dia¬ 
mond jewelry to changes in the price? 



Narrative Reasoning 

You can carry out a tiieoretical analysis in a variety of ways, from the less 
abstract to the more abstract. At one end of the spectrum is die use of a 
narrative reasoning process. At the other end of die spectrum is the use of 
formal madiematical methods. Let’s examine these different approaches. 

Remenyi et al. (1998, 129) argue that “when one attempts to develop 
models of die world, these start as narrative descriptions within which the 
imagination is allowed to range freely and widely over many possibilities.” 
The way to begin this process is by creating what they call a “primary nar¬ 
rative,” diat is, a document that gives a detailed description of the research 



topic, based on everything die researcher has been able to find out on die 
subject. The next stage is “concept creation,” in which you review the pri¬ 
mary narrative so as to identify die essential concepts. The tiiird stage is to 
write a “higher order narrative,” a revised version of die primary narrative 
diat focuses on the concepts identified in die previous steps and ignores 
secondary details. The next stage is to examine the rewritten narrative to 
identify possible relationships between die concepts. The final step is to 
postulate hypotheses from die dieoretical relationships. Though brain¬ 
storming is die strengdi of die narrative approach, Remenyi et al. observe 
diat “The narratives are of course constrained by currentiy accepted 
knowledge and theories.” 

Though it isn’t necessary to follow this exact procedure, it does give you 
some useful ideas about how to proceed. Otiier scholars have suggested 
some additional questions to consider as you conceptualize your research. 
For example, Williams (1984) proposes that scholars think about the issue 
or problem behind die research question, about what is known about die 
sources of die problem, and about what variables are relevant to die analy¬ 
sis of die problem. Ednidge (1995) reminds us to diink about how otiier 
researchers have conceptualized problems similar to yours. 

Mathematical Reasoning 

Theoretical economists have traditionally used mathematical reasoning 
to conceptualize tiieir research, and specifically to formulate hypodieses. 
Indeed, die main behavioral relationships familial' to students, including 
demand curves, supply curves, and production and cost functions, were all 
derived matiiematically. Though we differentiate between madiematical 
and narrative dieorizing, one can make a case that all research begins at a 
narrative level. Highly abstract dieories tend to start life as “stories” prior 
to being formally fleshed out in terms of mathematics. 

Chapter 4 defined deductive reasoning as reasoning diat starts from 
one or more assumptions and derives a specific conclusion from them. 
This suggests a two-step process, as follows. First, identify the relevant eco¬ 
nomic assumptions for die problem at hand. This is analogous to die first 
diree stages in die narrative approach discussed earlier. Second, use math¬ 
ematics to manipulate die assumptions so as to derive a conclusion or hy- 
podiesis. This is comparable to die last two stages in the narrative 
approach. Let’s illustrate this process widi a series of examples. 

Example 1: Optimizing Models 

Economists employ two types of approaches to madiematical dieorizing. 
The first approach is called optimizing models, and the second is called 




Mathematical Reasoning 




ad hoc models. Optimizing models are based on the assumption that the 
behavior of economic agents is driven by their attempts to maximize util¬ 
ity or profits or to minimize costs. 

Suppose our research question asks what factors determine how much 
output Acme Widget Company will supply? We begin by assuming that 
Acme will choose tire level of output that will maximize profit. We know 
that profit is the difference between revenue and costs. Suppose that 
Acme’s revenue and cost functions are the following: 

Total Revenue = P*Q 

Total Costs = 4 * Q 2 

Thus, our problem has four assumptions: 

1. The assumption that Acme will choose the level of output to maxi¬ 
mize profits; 

2. The definition of profit as total revenue minus total cost; 

3. The definition of total revenue as P * Q and 

4. The definition of total cost as 4 * Q 2 

The next step is to mathematically manipulate the assumptions to de¬ 
duce a conclusion. 

By substituting the definitions of total revenue and total cost into the 
definition of profit, we obtain Acme’s profit function: 

Profit = P * Q - 4 * Q 2 or Profit = f [ Q ] for short. 

This profit function typically has tire shape of a parabola or “hilltop.” If we 
were managing Acme, we could experiment with different levels of Q to de¬ 
termine the output level with the highest profit. We can more systematically 
solve this type of optimization problem by using calculus. Conceptually, we 
wish to find the “top” of the profit function, where the slope of die profit func¬ 
tion is zero. We know that mathematically we can find the slope of a function 
at any point by calculating the “derivative” of the function. If we want to know 
where the slope equals zero, we can set the derivative equal to zero and solve 
for Q. In this case, the slope of the profit function, dProfit/dQ = P — 8Q. 
Setting this equal to zero and solving for tire desired Q yields: 3 

0 = P - 8Q or Q = .125 * P 

Thus, our mathematical operations on tire problem lead us to tire con¬ 
clusion that Acme’s supply curve should be air upward-sloping function of 
the price of Acme’s product. This conclusion is based on the positive co¬ 
efficient ( + .125) on P. This is a testable hypothesis, derived by manipulat¬ 
ing the assumptions of tire model. 





Example 2: Ad Hoc Models 

Ad hoc models are models drat are not derived from optimizing prin¬ 
ciples. Rather, tire hypothesized relationships come from common sense 
or experience. The following is air example of a hypothesis that is derived 
by manipulating an ad hoc model. 

Suppose tire Bush Administration has proposed an economic stimulus 
package of $100 billion worth of spending increases. Our research ques¬ 
tion asks: What effect will tire stimulus package have on GDP? Will it be suf¬ 
ficient to correct the slowdown? Let’s assume drat the U.S. economy can 
be modeled by a simple Keynesian model consisting of three equations: 
Air identity to define aggregate demand: AD = C + I + G 
A behavioral equation to define consumption expenditure: 

C — a + b * Q 

Air equilibrium condition: AD = Q 

The consumption function is what makes this an ad hoc model. We did 
not obtain this equation as a result of trying to maximize or minimize some 
function. Rather, we slapped the optimizing and, based on experience, 
simply proposed that consumption expenditure should be positively re¬ 
lated to income or GDP. We further assume that investment expenditure 
is given and that the parameters of the consumption function (aand'6) are 
known. 

Thus, we have five assumptions: 

1. The assumption that the U.S. economy can be adequately modeled 
by three equations; 

2. The definition of aggregate demand as tire sum of G, I, and G; 

3. The assumption that consumer spending can be modeled by the be¬ 
havioral equation given above, where the parameters are known; • 

4. The assumption that investment expenditure is given; and 

5. The equilibrium condition. 

Using algebra, we can solve tire model for Q: 

Q= [1/(1-&)]*(« +I+ G) 

Remember that we wish to determine how tire increase in government 
spending (dG) will affect the level of GDP (dQ). Using some simple cal¬ 
culus, we can see that dQ = [1/(1 — £)] * dG, again assuming that a and / 
are constant so that da and dl are both zero. 

Suppose b, which represents tire marginal propensity to consume, has a 
value of 0.75. In other words, when an individual’s income increases by 
one dollar, he or she spends $0.75 more. Then, from the equation just 
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given, 1/(1 — b) equals 4. Thus, a $100 billion increase in government 
spending should, assuming this theory is correct, cause a $400 billion in¬ 
crease in GDP. This is a testable hypothesis that was deduced from tire 
mathematics. 

A Commonly Used Shortcut: 

Modifying an Existing Model 

Though it is always useful to go through the theorizing process just de¬ 
scribed, rarely do economists create an entirely original model from 
scratch. We mentioned in tire beginning of tire chapter that oftentimes re¬ 
searchers can readily apply a common economic theory to a research 
problem. These theories include demand and supply functions, produc¬ 
tion and cost functions, aggregate production functions, and consump¬ 
tion, investment, and money demand functions. 

Often researchers take an existing model, which has already been ap¬ 
plied to the topic they are interested in, and modify it in some way that 
seems an improvement over tire original. Recall from Chapter 3 that one 
purpose of tire literature survey is to avoid having to reinvent the wheel, 
and instead build on tire work of scholars who came before. The literature 
survey often shows what has been a helpful way to approach a research 
problem, as well as what hasn’t worked. So, starting with an existing model 
that was at least partially successful and then modifying that model to im¬ 
prove it is a time-honored approach. 

This can be done either by going through the formal optimizing 
process or by adopting a more ad hoc approach. The ad hoc way is ap¬ 
propriate if you know what tire outcome is going to be. For example, if you 
apply a very well-known theory, say, the theory of demand, it makes little 
sense to work through tire mathematics of utility maximization so as to de¬ 
rive a demand curve, since all economists know that tire result will be that 
quantity demanded should be negatively related to tire price. Let’s illus¬ 
trate this with two examples. 

Example 3: Intertemporal Utility Maximization with Money 

Economists going back at least to Irving Fisher have known that eco¬ 
nomic agents’ decisions about consumption versus saving can be analyzed 
as an application of intertemporal utility maximization. Indeed, this was 
tire idea behind Modigliani’s Life-Cycle Hypothesis for Consumption Ex¬ 
penditure. In that model, income in each period is allocated between two 
items: consumption, which provides immediate satisfaction, and saving, 
which earns interest and future satisfaction. 

aai 





A Commonly Used Shortcut: Modifying an Existing Model 


The Existing Theory: Consumption Today versus Consumption 
Tomorrow We can first look at utility as the choice between consumption 
today versus consumption tomorrow. 

U =/(C„, C a ) 

We can then maximize this, using tire Lagrangian technique, subject to 
the following budget constraint: 

MCo-Yo- (Yj-CjKl+r)) 

The following set of equations shows the maximization subject to con¬ 
sumption today, consumption tomorrow, and X. 

d\J/dC 0 = X 

au/aCj = x/(i + r) 

dU/d\ = Co - Y 0 - (Yi - Ca) (1 + r) 

We can look at the marginal rate of substitution here by tire following 
equation: 

(au/ac 0 )/(su/ac 1 ) = x/(x/(i + 0) = 1 + r 

Therefore, the choice between consumption today versus consumption 
tomorrow depends on the interest rate. 

For a specific functional form, we use a log Unear function, as follows. 

lnC 0 + InCa - X(C 0 - Y„ - (Yi - Q) (1 + r)) 

Maximizing this function, we get the following set of equations. 

au/ac 0 = i/Co - x 

au/aCj = l/Cj - \(i + r) 

aU/aX = Co - Y 0 - (Y x - Ca)/(1 + r) 

Solving the first-order conditions for Cq implies that consumption is 
proportional to wealth. 4 

C 0 = aW 0 

Modification of the Existing Theory: The Addition of Money Suppose 
we wanted to add another item to the basic model, namely, money. We 
would sketch out the basic model in the traditional way, but then add leal 
money to the utility function. 0 
U = /(C 0 , Q, M 0 , Mj) 

In accordance with our income equation the budget constraint will be: 

\(C 0 - Y 0 - AM 0 - (Yj - Ci - AM X )/(1 + r)) 

w 



Maximizing this function )delds the following set of equations: 
dU/dC 0 = X 
dU/dQ = X/(l + r) 
dU/AM 0 = -X 
SU/aAMj = x/(l + r) 

3U/SX = (C 0 - Y 0 - AM 0 - (Yj - Cj - AMj)/(l + r)) 

In addition to the marginal rates of substitudon found in Example 3, we 
also find the following: 

(dU/dC 0 )/(dU/dAM 0 ) = -1 
(au/aCjj/cau/aAMj) = 1 
(au/AM 0 )/(au/aAM 1 ) = — 1/(1 + r) 

We can see from this that the marginal rate of substitution between 
money today versus money tomorrow is based on the interest rate. 

As a result of this utility maximization function, we can see the following: 
C 0 = aW 0 
M d = bW 0 

In other words, money demand in any period should be proportional 
to total wealth, just as consumption is. 

Example 4: Ad Hoc Example 

The economics literature on the effects of parental work hours on chil¬ 
dren’s cognitive development typically applies the notion of an educa¬ 
tional production function. The idea is that just as a firm’s output is 
produced by combining capital, labor, and other inputs, one can tteat cog¬ 
nitive development as being “produced” by combining inputs of parental 
time and quality, family income, and so on. To date, the literature has not 
shown a consensus about whether maternal employment adversely affects 
children’s cognitive development. Suppose you hypothesize that the rea¬ 
son for the mixed results is the failure to control for tire quality of child¬ 
care parents use. It would be straightforward to add child-care quality as 
an additional input to tire educational production function, treating it an¬ 
alytically as any other input. 

What Makes a Good Research Hypothesis? 

The product of the theorizing process should be a research hypothesis. In 
Chapter 4, we noted that an argument could be considered both the princi¬ 
pal assertion and also the entire package of assertion and supporting eri- 
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NOTES FOR NOVICE RESEARCHERS 


Designing the Research Project 

Novice researchers often have difficulty determining how to turn a 
research problem into an original research design. An original research 
project does not require a truly original approach. Rather, it is common 
practice to modify an existing approach. This can be done in several 
useful ways. For example, we can take the same theoretical model as 
was employed in a previous study, but use: 

A different method of empirical testing, or 
A different data set (e.g., a different country, or a different time 
period). 

Alternatively, we could take a model that has been used to study 
some other topic and apply it to the topic we are interested in. Then 
again, we can modify the theoretical model in some way, as we have 
described in this chapter. For example, we could take Economist A’s 
basic model, but use Economist B’s method for doing some aspect of 
A’s model. 


dence. In the same way, one can think of the hypothesis and the theoretical 
explanation as the answer to tire research question. The hypothesis is the crux 
of tire conceptual or theoretical analysis. To assess a theoretical argument, we 
test the hypothesis. One of the most important factors leading to a successful 
research project is a good research hypothesis. Ethridge (1995) suggests that 
a good research hypodresis must have the following characteristics: 

1. It should be stated clearly and specifically in a way that can’t be mis¬ 
interpreted. 

2. It must be able to discriminate clearly from alternative hypotheses. 

3. It must be capable of being proved false. 

4. It should be empirically testable (and nontririal); that is, there must 
be reasonable statistical means and reasonable data available for 
testing it. 

5. It must be derived from tire theoretical analysis; otherwise, any sta¬ 
tistical results will lack validity, demonstrating at best correlation 
rather than the causation that we seek. Note that either an optimiz¬ 
ing or an ad hoc approach can be used here, as long as the hypoth¬ 
esis is linked to the underlying the 017 . 
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Exercises 


Let’s illustrate with the diamond jewelry example. The hypothesis can 
be stated clearly that the demand for jewelry depends negatively on tire 
price (Characteristic 1). The alternative hypothesis would be that the de¬ 
mand does not depend negatively on the price. This is a clear distinction 
(Characteristic 2). Either the hypothesis is true or false (Characteristic 
3). The hypothesis can be tested using regression analysis and data ob¬ 
tained from tire industry and public sources (Characteristic 4). Finally, 
as we showed earlier, the hypothesis is derived from the theory of de¬ 
mand (Characteristic 5). 

SUMMARY 

0 When you apply economic theory to a research question, you ask, “Can 
this question be expressed or interpreted in terms of a basic economic 
theory such as supply, demand, production, cost, and so on?” 

• Theorizing is the process of brainstorming about an issue to identify the 
essential concepts involved and the logical connections between those 
concepts. 

° The result of theorizing should be a hypothesis that provides a tentative 
answer to the research question. 

0 Economic researchers theorize using some combination of narrative 
and mathematical reasoning. 

0 Theorizing is an exercise in deductive reasoning—for example, where 
economic assumptions are manipulated mathematically to derive a pre¬ 
diction or conclusion. 

• It is common practice for researchers to modify an existing model in the 
literature rather than creating an entirely original model from scratch. 

MOTES 

1. Wyrick (1994) notes that the validity of these proposed relationships is 
strengthened if they draw on the findings of other researchers. 

2. In applying tire general theory, one might also ask if there are any in¬ 
stitutions or other real-world constraints that are assumed away in the 
theory but whose presence in the real world has a bearing on our analy¬ 
sis? If so, what are the effects likely to be (e.g., rent controls)? 

3. This result was obtained using the power rule: for Y = aX n , dY/dX = 
n*aX n ~~ 1 . 



4. See Miller et al. (1974), p. 95. 

5. This modification is from an undergraduate research paper, Fallen (2000). 

SUGGESTIONS FOR FURTHER READING 

Epstein and Kemberger (2005), Chapters 4 and 5 —Excellent introduction of 
die logic behind economic theories and models. Thoughtful examples 
fit right in with tire discussion in this chapter. 

Ethridge (1995), Chapter 8 —Provides a nontechnical explanation of the 
theorizing process. 

Remenyi et al. (1998) —Offers a detailed explanation of the narrative rea¬ 
soning process. 

Wyrick (1994), Chapter 10 —Provides another perspective on theorizing, ad¬ 
dressed to undergraduate researchers. 

EXERCISES 

1. What economic theory can be applied to tire following topics? 

° The effects of foreign direct investment on economic growth in de¬ 
veloping countries. 

0 Are automobile sales more sensitive to changes in price or financing? 

0 The effects of free agency on salaries in baseball. 

• Housing prices in die [fill in a city] area. 

2. Choose three studies from your literature survey that you found to be 
relevant to your research topic. Identify tire economic theory that each 
study applies to its research question. 

3. When you decide to apply an economic theory like supply or demand 
to your research as we described in the beginning of tire chapter, are 
you choosing an optimizing model or an ad hoc model? 

4. Example of using an optimizing model to theorize: Determine the demand for 
advertising by adding advertising as an input in the production func¬ 
tion and then minimize cost, subject to an output constraint. 

5. Example of using an ad hoc model to theorize: The macro economy has been 
modeled frequently using tire Cobb-Douglas production function: Q, = 
A*L a *K b , where Q = GDP, L = labor input, K= capital input, and A is 
technical change. This production function has great appeal, since the 
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c h a p t e 


"It is a capital mistake to theorize before 
one has data. Insensibly one begins to 
twist facts to suit theories, instead of 
theories to suit facts." 

SIR ARTHUR COMAN DOYLE 


ff| “ 

b * s i s the first of two chapters on locating and collecting economic 
hi data. In this chapter, we focus on how data are constructed and 
_ Jli where lhe > 7 ma 7 be found, both in primary and secondary sources. 

■. • bl the next chapter, we explain how to compile a data set for your 

own research project. 

A key part of any empirical research project is collecting and manipu¬ 
lating data. It is never too early to begin looking for potential data sources. 
After all, a research project can be hamstrung by inadequate data just as 
easily as by a lack of a clearly conceived, testable hypothesis. In fact, it is 
not uncommon for novice researchers to invest a great deal of time and ef¬ 
fort developing a research project only to discover that data aren’t avail¬ 
able to adequately test the hypothesis. Don’t let this happen to you. 

Even though this book describes the research process sequentially, in 
practice, many steps can be performed in parallel or at least in overlapping 
steps. Once you have settled on a research question, it is wise to begin look¬ 
ing for usable data. To begin, you should look for data in the general area 
of your research (e.g., macro data, international trade, financial data, 
etc.). As you review the literature on your topic, pay attention to the data 
sources previous studies have used. Think about their strengths and weak¬ 
nesses. Would they work for your research? 
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The Structure of Economic Data 


Once you have sketched out your conceptual or theoretical model, 
you should delve into your data search in earnest. What variables do 
you need data for in order to test your research hypothesis? Often¬ 
times, researchers must simplify their theoretical model because the 
data are inadequate. Either there are insufficient data points for one or 
more variables, or a specific variable they need is not available. Before 
we get into the details of developing your own data set, to be discussed 
in Chapter 9, we first need to examine the process of data creation and 
the structure of published data. 

Data Creation 

In Chapter 1, we noted that many people think that knowledge consists 
of facts that can be plucked from a tree in the forest, once that tree is dis¬ 
covered. By contrast, we have argued that knowledge is created; that 
knowledge is the interpretation of facts. 

In the same way, people tend to think of statistics as facts. This is mis¬ 
leading or wrong. The vast majority of data are constructed rather than 
collected. Government or private organizations that do a serious job of 
constructing statistics make decisions throughout tire process that affect 
the quality of that data for any given purpose. 

Steps in Data Construction 

Best (2001) identifies three steps in the construction of a data series: 

1. Defining the concept; 

2. Deciding how the concept will be measured; and 

3. Determining how to define the sample on which the data will be 
based. 


Let’s examine these steps to get a better sense of what published data 
really mean. 

Suppose we are interesting in finding income so we can test the theory 
of consumer behavior. What is die basic consuming unit? There are two 
commonly used possibilities: households and families. If we choose fami¬ 
lies, our concept of income will be average family income. If we choose 
households, it will be average household income. If you compare the two 
variables, tiiey will be similar but not the same. 

This illustrates an important point about data. Every data series is con¬ 
structed for a specific purpose. You should realize that tiiat purpose may 
not be the same as yours. As a consequence, a given data series may not be 
defined or measured in a way that best matches your needs. Unless you are 
going to construct your own data, you will have to live with this problem. 




However, you can minimize this problem by doing your best to discover ex¬ 
actly where the data come from and what they actually measure. 

Suppose we’ve chosen to define our variable as average family income. 
How is that to be measured? What will be included? Should we include la¬ 
bor income only, for example, wages and salaries, or capital income, for in¬ 
stance, interest, dividends, capital gains, too? Should we include in-kind 
payments, such as food grown on farms and consumed by the fanners? It’s 
not sufficient to decide you want “total” income. You still need to define 
what is included. 

How are we going to measure average family income? What measure of 
the average should we use? Unless the data are approximately normally 
distributed, which income data tend not to be, you can get substantially 
different values for the mean, median, and mode. Thus, for income data 
one tends to use the median rather than the mean as a measure of central 
tendency. 

Sample Data 

Most social science statistics are based on sample data rather than pop¬ 
ulations. Therefore, the data that are published are extrapolated from 
samples. For example, average family income is not, in fact, the average in¬ 
come of all families; rather, it is the average income of the families in the 
sample. 

Only if the sample is random, and thus truly representative of the pop¬ 
ulation, will tire sample statistics correctly measure the population. A truly 
random sample is difficult and expensive to acquire, so typically samples 
are less than truly random. As a consequence, bias creeps into the results. 
As such, you should think of all data as estimates rather than facts. 

Best (2001, 161) sums it up well: “Every statistic must be created, and the 
process of creation always involves choices that affect the resulting number.... 
No statistic is perfect, but some are less imperfect than others.” 

The Structure of Economic Data 

When we discussed surveying the literature on a research topic, we 
mentioned that knowing how the economics profession organizes its lit¬ 
erature can help scholars find information quickly and efficiently. The 
same thing can be said for data. 

It is important to differentiate between those organizations that collect or 
produce data and those that publish it. This is the same distinction as pri¬ 
mary versus secondary sources of information. Let’s talk first about the 
sources of data, and then we can figure out the best places for users to 
obtain that data. 






Statistics are not collected at random. Rather, they are typically the re¬ 
sult of a specific data collection effort or process. The product of this 
process is a specific data set that includes a collection of certain variables. 
The characteristics of data sets are described in the next section. The sec¬ 
tion that follows provides an overview of the various agencies that collect 
tire data and the major data sets they produce. Then, we will explain tire 
most important of those data sets hr detail. 

Characteristics of Data Sets 

Survey data comes in three forms: time series, cross-section, and longi¬ 
tudinal (or panel) data. Time-series data gives different observations or 
data points on the same variable at different points across time. For ex¬ 
ample, a time series could consist of annual data for U.S. GDP over the 
time period 1950-2000. Cross-section data, by contrast, gives different ob¬ 
servations of a comparable variable at the same point in time. For exam¬ 
ple, a researcher could obtain a cross section of data that consisted of 
average disposable personal income across the fifty states in the United 
States. Finally, longitudinal data take a cross-section sample and follow it 
over time. 1 For example, a sample of family income for the same ten fam¬ 
ilies over five years would be a longitudinal data set. Note that all three of 
these examples have a sample size of fifty observations, but the basis for 
these observations is different. Longitudinal data are an example of a mi¬ 
cro data set, meaning that the data points or observations are of individ¬ 
ual economic agents such as individuals, households, or firms. Contrast 
this with macro data compiled at the national level, which might measure 
national income or total expenditure by consumers. 2 

Time-series data are available in different frequencies. Frequency 
means how often the concept is measured. The U.S. Census is conducted 
every ten years. U.S. GDP is measured quarterly. Unemployment is avail¬ 
able monthly. Interest rates are available daily, and stock prices may be 
available hourly. Each of these variables is also available for longer time pe¬ 
riods. For example, U.S. GDP, unemployment, interest rates, and stock 
prices are available at annual rates, too. Figure 8.1 lists a selection of com¬ 
monly used time-series variables and the frequencies with which they are 
measured. 

Cross-section data have a comparable concept to data frequency that in¬ 
volves the “unit of analysis.” With time series data, the observations are 
taken at different points in time. By contrast, with cross-section data, the 
observations are measured across different nations, states, or other units 
of analysis. Suppose you need data for income. Average disposable per¬ 
sonal income, for example, can be measured in each of the fifty states. Al- 
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Figure s.i Available Frequency of U.S.Time Series Data 


Data Available Annually: 
Data Available Quarterly: 


Data Available Monthly: 


Data Available Weekly: 
Data Available Daily: 


Fixed Reproducible Capital-Fixed-Asset Tables 

Gross Domestic Product and Major 
Components— 

Advance Estimate, available with a 
1-month lag; Preliminary Estimate, 
available with a 2-month lag; Final Estimate, 
available with a 3-month lag. 

Corporate Profits 

Productivity Growth 

Employment Cost Index 

Flow-of-Funds Accounts 

U.S. International Transactions (complete 

balance of payments: current + capital 

account), available with a 2.5-month lag. 

Personal Income and Outlays, Disposable 
Personal Income, Personal Consumption 
Expenditures, available with a 2-month lag 
Retail Sales 

Employment and Unemployment Rates 

Consumer Price Indices 

Producer Price Indices 

U.S. International Trade in Goods and 

Services (i.e., current account), 1.5-month lag. 

Index of Industrial Production, Capacity 

Utilization Rate, 1.5-month lag. 

Consumer Credit, 5-week lag. 

Monetary Base, Monetary Aggregates 
Exchange Rates, Selected Interest Rate 


tentatively, you might be able to obtain family income data across all coun¬ 
ties in a particular state, or data on household income for sample house¬ 
holds in a particular county. The researcher’s decision about what 
frequency or unit of analysis to select can be guided by economic theory, 
but is often determined by data availability. 




Organizations That Collect and Publish Data 

A number of U.S. governmental, international, and private organiza¬ 
tions gather economic and social statistics. 3 These include the Census 
Bureau, the Bureau of Economic Analysis (BEA), the Bureau of Labor Sta¬ 
tistics (BLS), the Federal Reserve, the International Monetary Fund, tire 
World Bank, and the United Nations, as well as private organizations such 
as the Conference Board. Let’s survey some of these. 

Census Bureau 

The Census Bureau (www.census.gov) constructs general and popu¬ 
lation statistics, as well as foreign trade data on goods and services im¬ 
ported into or exported from the United States. An excellent place to 
start looking for general statistics is the Statistical Abstract of the United 
States. Though the Statistical Abstract may not have the specific data you 
need, the references (source information) for each of its tables can give 
you clues about where that specific data may be found. The Census Bu¬ 
reau publishes several other useful references, such as the State Met¬ 
ropolitan Area Data Book, USA Counties, and the County & City Data Book. 
The Census Bureau conducts the decennial U.S. Census of Population, 
which includes detailed socioeconomic data. It also conducts several 
lesser-known surveys, such as the Economic Census (every five years), the 
Annual Survey of Manufactures, Cunent Industrial Reports (annual/quarterly/ 
monthly), and the American Housing Survey. Finally, the Census Bureau 
conducts a number of surveys on behalf of other agencies (e.g., BLS). 
These will be discussed later. 

Bureau of Economic Analysis 

The Bureau of Economic Analysis (www.bea.doc.gov) constructs the 
major U.S. macroeconomic indicators. The BEA also produces estimates 
of private and public capital stocks (under the heading “Fixed Assets”), 
GDP by industry, U.S. balance of payments, U.S. international invest¬ 
ment position, and regional and state data. The BEA has several major 
data collections, most prominently the National Income & Product Accounts 
(www.bea.doc.gov/bea/dn/nipaweb). It includes detailed estimates of 
GDP and its components (e.g., personal consumption expenditure, in¬ 
vestment expenditure, government expenditure, and net exports), as well 
as price indices for each. The report also provides a detailed breakdown 
of national income. The BEA publishes the monthly Survey of Current Busi¬ 
ness (http://www.bea.doc.gov/bea/pubs.htm#SCB%20Table), which pro¬ 
vides notice of revisions of each of its products as well as detailed 
discussion of data sources and methodology. 
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Bureau of Labor Statistics 

The Bureau of Labor Statistics (www.bls.gov) constructs data related to 
employment issues, productivity, and prices (e.g., the Consumer Price In¬ 
dex, the Producer Price Index, tire Employment Cost Index, and compo¬ 
nents of each). The BLS produces a number of major data collections, 
which will be discussed later in this chapter, including tire Current Popula¬ 
tion Survey, Current Employment Statistics, the Consumer Expenditure Survey, as 
well as tire National Longitudinal Surveys. The BLS also publishes tire 
Monthly Labor Review, which plays a role for tire BLS similar to that of the 
Survey of Current Business for the BEA. 

The Federal Reserve 

The Federal Reserve (www.federalreserve.gov) constructs principally fi¬ 
nancial data: interest and exchange rates, money stocks and components 
(e.g., bank reserves), debt measures, bank assets and liabilities, official re¬ 
serves, household assets and liabilities, and corporate debt. These data are 
available at http://www.federalreserve.gov/rad.htm. The Fed also con¬ 
ducts the U.S. How of Funds Accounts and the Survey of Consumer Finances. 
Though much of the data from tire Federal Reserve System is produced by 
the Board of Governors (BOG), researchers should not neglect the data 
available from the regional Federal Reserve banks, especially if one is inter¬ 
ested in regional economic research. Some of these data are not available 
from dre BOG. For example, tire Adanta Federal Reserve bank computes a 
trade-weighted foreign exchange value for dre U.S. dollar, based on the 
fifteen-largest trading partners of dre United States. Also, the St. Louis Fed 
has an easy-to-access collection of U.S. macro data (called FRED II). 

International Agencies 

A number of international agencies also collect economic and financial 
data of use to researchers. These agencies include the International Mon¬ 
etary Fund, the World Bank, the Organization for Economic Cooperation 
and Development, EuroStat, the Asian Development Bank, and the Inter- 
American Development Bank. A number of United Nations’ agencies also 
publish data. Most of dris data, however, is available in electronic formats 
only via expensive subscriptions. However, it is fairly widely available in 
print formats in many college and university libraries. 

Major Primary Data Collections 

In this section, you can find more detailed explanations of the major 
data collections mentioned previously, as well as other major primary-data 
collections. A more comprehensive survey of the major data collections 
may be found in Maier (1999). 
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U.S. National Income and Product Accounts 

The National Income and Product Accounts (NEPA) are the official na¬ 
tional accounts (i.e., macroeconomic statistics) of the United States and can 
be found at (http://wvw.bea.doc.gov/bea/dn/nipaweb/index.asp). This is 
the source of all U.S. macroeconomic data. The accounts include real and 
nominal data on national income and product; personal income and ex¬ 
penditures; government receipts and expenditures at the federal, state, and 
local levels; foreign transactions; sating and investment; income and em¬ 
ployment by industry; and price indices for nearly all these categories. This 
data set offers an amazing amount of detail. For example, Table 2.6, “Per¬ 
sonal Consumption Expenditure by Type of Product,” indicates that U.S. 
residents spent $80.6 billion on higher education in 2000, but only $62.1 bil¬ 
lion on dental services. Annual data are available back to 1930, and quarterly 
data (for the major series) are available back to 1952. Much of tire data are 
available online and are easily downloadable in spreadsheet format. For ad¬ 
ditional information, see Maier (1999), Chapter 7. 

U.S. Flow of Funds Account 

The Flow of Funds Account (http://www.federalreserve.gov/releases/ 
Zl/) collects data on financial flows across tire U.S. economy. It includes 
detailed information on levels of and changes in assets and liabilities in 
each sector of the economy. Annual data are available back to 1945, quar¬ 
terly data are available back to 1952, and monthly data for the major com¬ 
ponents of domestic nonfinancial debt are available back to 1955. 

U.S. Balance of Payments Accounts and International 
Investment Position of the U.S. 

The international accounts of the U.S. (http://www.bea.doc.gov/bea/ 
dil.htm) are compiled in two sets of data that correspond to a flow account 
and a stock (or asset) account. The flow account measures exports and im¬ 
ports of goods and services (the Current Account) as well as capital in- and 
outflows (the Capital Account). These data are available quarterly for tire 
aggregate data that measure U.S. balance of payments vis-a-vis the rest of 
the world. Quarterly data are also available for U.S. international transac¬ 
tions with Western Europe, the European Union, the United Kingdom, 
Eastern Europe, Canada, Latin America, Japan, and Australia. Annual data 
are available for trade with Belgium-Luxembourg, tire Netherlands, Ger¬ 
many, France, Italy, Mexico, Venezuela, and South Africa. The aggregate 
data and some of the country data are available back through 1960. The 
stock account data, which are collected annually, summarize U.S, holdings 
of assets from and liabilities to the rest of the world. More detail is avail- 
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able (by country and industry) for data on U.S. direct investment abroad 
and foreign direct investment in tire United States. 

U.S. Census of Population and Integrated Public Use 
Microdata Series 

The U.S. Census (http://www.census.gov/main/www/cen2000.html) 
is the decennial measurement of the U.S. population. In addition to 
counting the total number of Americans, the Census collects a wide 
range of socioeconomic information that is disaggregated by location, 
from the national level all the way down to zip codes. The Minnesota 
Population Center’s Integrated Public Use Microdata Series (IPUMS, 
http://www.ipmns.iunn.edu/) is comprised of twenty-five “high-precision” 
samples of the U.S. population from U.S. censuses since 1850. The data 
include economic and social variables. For additional information, see 
Maier (1999), Chapter 2. 

Current Population Survey 

The Current Population Survey (CPS, http://www.bls.gov/cps/home 
•htm) is a monthly survey of a sample of U.S, households designed to col¬ 
lect data on employment, unemployment, and individuals not in the labor 
force.- Its data are classified by a number of demographic characteristics, 
including age, gender, race, occupation, industry, and hours of full- or 
part-time employment. Some data are available on the work experience 
and educational background of workers. For additional information, see 
Maier (1999), Chapter 9. • 

Current Employment Statistics 

Current Employment Statistics (CES, http://www.bls.gov/ces/home 
.htm) is a monthly survey of payroll records of a large sample (more than 
500,000) of U.S. businesses. It collects data similar to that of the CPS, but 
the source is employers rather than households. Specifically, the CES col¬ 
lects detailed data on employment (i.e., number of employees), weekly 
work hours, and hourly and weekly earnings of nonfarm workers by in¬ 
dustry. The data are organized in two ways: Ownership (private versus pub¬ 
lic employer, and if the latter, then federal, state, or local government) and 
Industry (using the North American Industry Classification System 
(NAICS). Data are available at the national level, as well as by state and 
metropolitan area. 4 National data for employment are available back to 
1919. National data for work hours are available back to 1947; national 
data for hourly earnings are available back to 1962. Data for states and 
metropolitan areas are available back to the 1980s. 
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The Economic Census 

The Economic Census (http://wnv.census.gov/epcd/www/econ97 
.html) provides a detailed snapshot of the production side of the U.S. econ¬ 
omy. It covers all tire production sectors of die economy, except for agricul¬ 
ture and government (which are addressed in separate surveys). These 
include: mining; utilities; construction; manufacturing; wholesale and retail 
made; transportation and warehousing; information; finance; insurance; real 
estate; rental and leasing; professional, scientific, and technical sendees; man¬ 
agement; educational sendees; healtii care; entertainment; food sendees; and 
others. Beginning with die 1997 Economic Census, data are organized under 
die NAICS industry classification system rather tiian die previous Standard 
Industrial Classification (SIC) system. The Economic Census is a survey of 
“establishments,” which are defined as individual business units at a single lo¬ 
cation. Thus, a single company (e.g., Pizza Hut) may include multiple estab¬ 
lishments. Key data collected include die number of establishments, number 
of employees, payroll, measures of sales or output, costs, and assets. Data are 
available (in decreasing detail) at die national, state, metropolitan area, 
county and zip code levels, and also by specific industry. 

Annual Survey of Manufactures 

The Annual Survey of Manufactures (ASM) is a survey of a sample of 
manufacturing establishments organized according to the NAICS industry 
classification system. The survey can be found at http://www.census.gov/ 
econ/overview/ma0300.html. The survey includes data on value of ship¬ 
ments, value added, employment, labor costs, materials costs, capital ex¬ 
penditures, energy consumption, and inventories. 

Current Industrial Reports 

The Current Industrial Reports (CIR, http://www.census.gov/cir/ 
www/index.html) provide annual, quarterly, and monthly data on pro¬ 
duction and shipments of manufactured goods in selected industries, 
including aerospace equipment, chemicals, computer and electronic 
components, consumer goods, industrial equipment, primary metals, tex¬ 
tiles, and apparel. The most disaggregated data are available at the annual 
level; the least at the monthly level. 

American Housing Survey 

The American Housing Survey (AHS, http://www.census.gov/hhes/ 
www/ahs.html) is a panel survey conducted biannually at the national 
level and every four years for major metropolitan areas. The national 
panel includes approximately fifty-five thousand housing units, including 
single-family homes, apartments, and mobile homes. The survey also in- 
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eludes data on housing and neighborhood characteristics, housing costs, 
and income of tire residents. 

Consumer Expenditure Survey 

The Consumer Expenditure Survey (CES, http://wvrv.bls.gov/cex/ 
home.htm) is a survey of the spending behavior of a sample of American 
households. The data are derived from a quarterly interview and a weekly 
diary that respondents fill out. The result is annual data on income and ex¬ 
penditures disaggregated by age, household size, and other demographic 
characteristics. Expenditures are broken down by major classes, including 
food, housing, apparel and services, transportation, health care, enter¬ 
tainment, personal care products and services, reading, education, to¬ 
bacco products and supplies, and personal insurance and pensions. The 
data are available back to 1980 for some series, and 1984 for the rest. For 
a similar data set, see Tables 2.6 and 2.7 of the NIPA. 

National Longitudinal Surveys 

The National Longitudinal Surveys (NLS, http://wwv.bls.gov/nls/home 
•htm) are a collection of surveys, conducted by the Bureau of Labor Sta¬ 
tistics over the past thirty years, to develop data on the labor market activ¬ 
ities, broadly defined, of several representative groups of Americans. 
These are probably the longest running and certainly the best-known 
panel data sets in the United States. The original surveys consisted of four 
cohorts: Young Men, Young Women, Mature Women, and Older Men. All 
cohorts were constructed to be nationally representative samples of their 
respective groups. The surveys include data on education, training, mar¬ 
riage, fertility, health, income, and assets. 

The Young Women’s survey includes women who were ages fourteen 
through twenty-four when first interviewed in 1968. The Mature Women’s 
survey includes women who were ages thirty through forty-four when first 
interviewed in 1967. These surveys are now conducted simultaneously in 
odd-numbered years. The Young Men’s survey, which was discontinued in 
1981, includes men who were ages fourteen through twenty-four when 
first interviewed in 1966. The Older Men’s survey, which was discontinued 
in 1990, includes men who were ages forty-five through fifty-nine when 
first interviewed in 1966. 

There have been three subsequent cohorts: ‘Youth 1979,” “Children of 
Youth 1979 - Supplement to NLSY79,” and ‘Youth 1997.” The NLSY79 is a 
survey of men and women born in die years 1957-64; respondents were 
ages fourteen through twenty-two when first interviewed in 1979. The 
NLSY79 “Children and Young Adults” is a survey of die children of the 
women in die NLSY79. The NLSY97 is a survey of young men and women 
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bom in the years 1980-84; respondents were ages twelve through seven¬ 
teen when first interviewed in 1997. 

The NLS79 includes data on wages, work hours, industry, and occupa¬ 
tion. It also includes some information on spouse employment; highest 
level of education completed (by grade); high school GPA and DOD apti¬ 
tude test (ASVAB) scores; job training; family relationships (marriage, 
number of children, etc.). The “Children ofYouth 1979” includes data on 
cognitive, socio-emotional, and physiological assessments; household in¬ 
come; net worth; and health status. For a similar data set, see die Panel 
Study of Income Dynamics (PSID). 

Panel Study of Income Dynamics 

The Panel Study of Income Dynamics (PSID, http://www.isr.umich.edu/ 
src/psid/) is a longitudinal study of a representative sample of U.S. fami¬ 
lies. The study includes data on the families as well as the individual men, 
women, and children in them. Begun in 1968, the data were collected an¬ 
nually through 1997, and biennially starting in 1999. The sample size be¬ 
gan as forty-eight hundred families. By '2001, it had grown to over seven 
thousand families. Though similar to the NLS in terms of content, the 
PSID focuses more explicitly on families. 

The PSID includes data on income sources and amounts; poverty sta¬ 
tus; public assistance in the form of food or housing; other financial 
matters (e.g., taxes, inter-household transfers); family structure and 
demographic measures (e.g., marital events, birth and adoptions, children 
forming households); labor market work (e.g., employment status, work/ 
unemployment/vacation/sick time, occupation, industry, work experi¬ 
ence); housework time; housing (e.g., own/rent, house value/rent pay¬ 
ment, size); geographic mobility (e.g., when and why moved; where family 
head grew up; all states head has lived in); socioeconomic background 
(e.g., education, ethnicity, religion, military service; parents’ education, 
occupation, poverty status); and health (e.g., general health status, dis¬ 
ability) . 

Beginning in 1997, the PSID includes a Child Development Supple¬ 
ment. This survey uses a subsample of the PSID consisting of 2,394 house¬ 
holds and 3,563 children up to twelve years old. Its purpose is to collect 
data, every five years, on a variety of issues impacting the cognitive and 
emotional development of children. 

Surveys of Consumers 

The Survey Research Center (http://www.sca.isr.umich.edu) attire Uni¬ 
versity of Michigan publishes a number of indices that measure household 
expectations. The most prominent of these is tire Index of Consumer Senti¬ 



ment. These data are accessible through tire “guest” login at the Survey Re¬ 
search Center. They are also available from tire FRED II database at tire St. 
Louis Fed. 

The Conference Board (http://ww.conference-board.org) publishes 
the monthly Consumer Confidence Index. The most recent monthly data are 
available free from their webpage, but past data require a subscription. 

Survey of Consumer Finances 

The Survey of Consumer Finances (SCF, http://ww.federalreserve 
.gov/pubs/oss/oss2/scfindex.html) is a survey conducted by the Federal 
Reserve Board of Governors of selected demographic characteristics of 
U.S. families, including their income, balance sheets, and use of financial 
services. The survey has been triennial since 1983. For similar- data on the 
U.S. economy as a whole, see the U.S. Flow of Funds Accounts. 

Major Secondary Data Collections 

In the previous section we identified the major producers of U.S. eco¬ 
nomic and social data. You can certainly obtain data from those sources, 
especially if you are willing to dig for it. However, it is often easier to go to 
a secondary source of data. Secondary sources are usually more user 
friendly than primary sources, which tend to be designed for expert users. 
Additionally, just as it may be helpful to begin a literature search with sec¬ 
ondary sources, the same is true for data searches, especially if you are un¬ 
certain about what data are available on your topic or where those data 
may be found. What follows is not meant to be an exhaustive list but rather 
a sample of what is available. It also includes a sample of the major sec¬ 
ondary sources of international economic and social data. 

The Economic Report of the President 

The Economic Report of the President (ERP, http://w3.access.gpo 
.gov/eop/) provides easy access to nearly all the macroeconomic data un¬ 
dergraduates need for their courses (though not for their research pa¬ 
pers). This is a good first place to look for macro data. 

Economagic 

Economagic (www.economagic.com) is an excellent source of U.S. macro 
and regional as well as some foreign data. The advantages of Economagic 
are, first, that it is a very large site, covering data for more than one hun¬ 
dred thousand variables, and second, that it provides a single, easy-to-use 
interface for accessing all that data, which is produced by a variety of gov¬ 
ernment agencies. The data are available in multiple formats from spread¬ 
sheets to .PDF files. The site also allows users to do some data manipulation 
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before downloading. For example, if die data available focus on GDP but 
you need growth rates, Economagic will compute them for you. More ad¬ 
vanced features are available by subscription. 

FRED II (Federal Reserve Economic Data) 

FRED II (http://reseai-ch.stlouisfed.org/fred2/) is an excellent source 
for U.S. macroeconomic and financial data. It also provides regional data 
for die region of the counuy serviced by die St. Louis Federal Reserve 
Bank. The data are easy to download. 

STAT-USA/State of the Nation 

STAT-USA (http://www.stat-usa.gov/) is a U.S. federal government- 
sponsored data site. It provides a single interface for obtaining U.S. macro- 
economic, financial, and industry data. The primary data are obtained 
from the U.S. Bureau of Economic Analysis, die U.S. Bureau of Labor 
Statistics, and the Federal Reserve. The data are available by subscription 
or for free through federal depository libraries, which includes many uni¬ 
versities. STAT-USA is not as user friendly as Economagic or FRED II. The 
main advantage of this site is that it includes some specialized data that are 
unavailable from diose other sources. 

Inter-university Consortium for Political and Social Research 

Inter-university Consortium for Political and Social Research (ICPSR, 
http://www.icpsr.umich.edu/) is a major archive of data for social science 
research. It covers data on a broader range of topics, for example, survey 
and panel data, than the other (i.e., economics) sources discussed here. It 
is a good source of data for microeconomic research. ICPSR is a subscrip¬ 
tion service, but it is available to researchers at many universities. 

International Financial Statistics 

International Financial Statistics (IFS) is the principal data set of the In¬ 
ternational Monetary Fund (IMF). It is an excellent source of macroeco¬ 
nomic, international, and domestic financial data for IMF member 
countries. Data include exchange rates, international liquidity, interna¬ 
tional banking, money supplies, interest rates, price levels, international 
trade, government accounts, and GDP and its major components. Data are 
available in monthly, quarterly, and annual frequencies. Not all data series 
are available for all countries or for all frequencies. The main negative fea¬ 
ture (from an undergraduate’s point of view) is access. IFS is widely avail¬ 
able in hard copy, but it is not available online. It is possible to purchase 
the data on a single-user CD-ROM, but the price ($350 for the academic 
rate) makes it prohibitive for many colleges. For annual data, the IFS Year¬ 
book, even in hard copy, can provide a quick fix. One additional caveat is 
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that since tire data are supplied from member-country governments, you 
should not assume that they are directly comparable. For such compar¬ 
isons, a better data source might be tire Penn World Tables (discussed later 
in this section). 

World Economic Outlook Database 

The World Economic Outlook is a biannual (May and October) sur¬ 
vey of world economic conditions published by the IMF. The database 
(http://www.imf.org/search97cgi/s97is_eng.dll/search97cgi/metsrcheng. 
ini?action=FilterSearch&filter=spquery.hts&OueryText=weodb) is de¬ 
signed to complement the survey. It prorides annual data (back to 1970) 
on roughly fifteen macro series, for IMF member countries. These in¬ 
clude GDP (various measures), inflation, government budget balance 
(actual and structural), output gaps, net capital flows, and external debt 
and debt sendee. There are also half a dozen “world” macro aggregates, 
including trade volume. 

Penn World Tables 

The Penn World Tables (http://datacentre.chass.utoronto.ca/pwt/index 
.html) proride annual data for more than 150 countries on twenty-nine 
macro and related aggregates. The strength of this data set is that it offers 
the ability to make meaningful international comparisons between each 
country’s data. The range of data available is 1950 through 1992. Data are 
not available, however, for all countries over the full range. 

Joint BIS-IMF-OECD-World Bank Statistics on External Debt 

The Joint BIS-IMF-OECD-World Bank Statistics on External Debt database 
(http://www.oecd.org/statistics/jointdebt) provides definitive data on na¬ 
tions’ external debt. It includes data on debt balances and changes in debt. 

Eurostat 

Eurostat (http://epp.eurostat.cec.eu.int/portal/page?_pageid= 1090, 
1137397&_dad=portal&_schema=PORTAL) is the official statistical 
agency for the European Union. As such, it provides a large amount of 
high-quality data on a wide array of subjects, including national accounts, 
foreign trade, and finance. Current data are available for free, but histor¬ 
ical time series are fee-based. 

OECD Main Economic Indicators and National Accounts 

The Organization for Economic Cooperation and Development 
(OECD, http://www.oecd.org/statsportal/0,2639,en_2825_293564_l_l_ 
l_l_L00.html) compiles a number of excellent databases, including Main 
Economic Indicators and National Accounts of OECD Countxies. These 
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cover the major macroeconomic and financial data for the thirty member 

countries. The major negative feature of these data sets is accessibility. 

Though they are widely available in hard copy, tire online versions are fee- 

based. 

A final note: If you don’t find what you need in a secondary source, do 

consider the primary sources! 

SUMMARY 

o Data are constructed rather than collected. 

• Most social science statistics are based on sample data. 

® Survey data come in three forms: time series, cross-section, and longitu¬ 
dinal. 

® The majority of U.S. data are produced by tire Census Bureau, tire Bu¬ 
reau of Economic Analysis, fire Bureau of Labor Statistics, and the Fed¬ 
eral Reserve System. 

o Secondary data sources are often easier to use than primary data sources. 

• An overview of primary and secondary data sources is given in Appen¬ 
dix 8A. 

NOTES 

1. Strictly speaking, a panel is more than that; in other words, it is possible 
to create a “pool” of cross-sections, which is not a panel per se. 

2. When we talk about micro or macro data sets, we are describing the 
source of the observations, not the type of analysis the data are used for. 
One can test a macro hypothesis (e.g., a consumption function) with 
micro data, and vice versa (e.g., a demand function for an aggregate 
consumption good or service, using the decomposition of quantity and 
price data for personal consumption expenditures). 

3. For nations other than the United States, there are comparable gov¬ 
ernment statistical agencies. See, for example, the country entries un¬ 
der “World and Non-U.S. Data” at Resources for Economists on the Internet 
(http://www.rfe.org/Data/World/mdex.html). A similar listing is 
available at (http://labour.ceps.lu/statisticsframe.cfm). 

4. Data are also available for Washington, D.C.; Puerto Rico; and the U.S. 
Virgin Islands. 


•A 


Exercises 


SUGGESTIONS FOR FURTHER READING 

Best (2001 )—-Very thoughtful monograph on how data series are con¬ 
structed, and the ways they can distort what they purport to measure. 

Brown et al. (1996 )—Very readable introduction to tire Panel Study of Income 
Dynamics. Well worth reviewing if you are considering using tire PSID. 

Clayton and Giesbrecht (2001 )—Thoughtful handbook on how tire major 
macroeconomic statistics are computed, what they measure, and how to 
correctly use them. Discusses statistics on output, production and 
growth; investment and capital expenditures; employment, earnings, 
and profits; spending, sales, and expectations; tire price level, money, 
and interest rates; and financial markets, international trade, and for¬ 
eign exchange rates. 

Huff (1954 )—Classic monograph on tire potential problems with com¬ 
monly reported statistics. Very readable and intuitive. 

Maier (1999 )—Excellent guide to tire major social science statistics. In¬ 
cludes information on how to obtain the data online, as well as com¬ 
mentary on how the data are obtained and on their reliability. 

Morgenstem (1979 )—Classic critique of tire use of macroeconomic statis¬ 
tics. Anyone using macro data should be familiar with this. Reprinted 
from Morgenstem’s On the Accuracy of Economic Observations, 1963. 

Pergamit et al. (2001 )—Helpful introduction to the National Longitudinal 
Surveys. Worth checking out if you are considering using the NLS. 

Popkin (2000 )—Decent introduction to the National Income and Product 
Accounts. Worth reading if you are considering using the NIPA. 

Teplin (2001 )—Very good introduction to the Flow of Funds Accounts. 
Well worth reading if you are considering using the Flow of Funds. 


EXERCISES 

1. Go to the FRED II database at the St. Louis Federal Reserve Bank on¬ 
line at http://research.stlouisfed.org/fred2/. Download tire monthly 
unemployment rates for the past five years for the states in the St. Louis 
Federal Reserve region. Record the complete URL for each data series. 
Compare and contrast the state unemployment rates. 

2. Go to Economagic.com. Collect data for 2000 per capita disposable per¬ 
sonal income for your home state and all the contiguous states. Rank 
them from highest to lowest. Report the complete URLs. 



3. Find the primary source for data gross state product. Collect die data 
for 2002 for any ten states in die United States. Report the complete 
URLs. 

4. Suppose you need annual data for GDP for Germany, France, and die 
United Kingdom. What source would you go to and why would that be 
die best? 

5. Suppose you are asked to obtain annual data for an aggregate produc¬ 
tion function for die United States: Q = f[ L, K]. Track down sources 
for diese three variables and report the complete URLs. Explain why 
you chose each data source. 


Overview of Data Sources 


Places to Start 

Resources for Economists on the Internet (http://rfe.org) 

WebEc (http://netec.wusd.edu/WebEc) 

U.S. Macroeconomic Data 

National Income and Product Accounts—Economagic, FRED, STAT-USA 
U.S. Flow of Funds Accounts (macro/financial data) 

Other Nations' Macroeconomic Data 

National Statistical Agencies for Selected Countries (http://rfe.org/data/ 
world/index.htm) 

EC Member Countries—EuroStat 

OECD Member Countries—OECD Main Economic Indicators and OECD 
National Accounts 

IMF Member Countries—IMF International Financial Statistics 

Other Nations—World Economic Outlook, Penn World Tables, U.N. 
System of Accounts 

U.S. Labor Market Statistics 

Current Population Survey 

Current Employment Statistics 
Economagic 

U.S. Microeconomic Data 

Agricultural products—U.S. Department of Agriculture 
Production of Manufactured Goods—Survey/Census of Manufactures 

Demand for Consumer Goods and Sendees—Survey of Personal Con¬ 
sumption Expenditures (NIPA Table 2.4—PCE by Type of Expendi¬ 
ture) , or Consumer Expenditure Survey 

Demand for individual companies’ products—Hoover’s Online and Edgar 

U.S. Industry Data 

The Economic Census 
Annual Survey of Manufactures 
Current Industrial Reports 


U.S. Micro Data Sets 

National Longitudinal Surveys 

Panel Study of Income Dynamics 
IPUMS 

International/Trade Data 

U.S. Balance of Payments Accounts 

U.S. imports and exports—Bureau of Economic Analysis, Customs Bureau 
IMF International Financial Statistics. 

Eurostat 

International Investment Position of the U.S. 

World Bank Statistics on External Debt 


chapter 


Putting Together Your 
Data Set 

“Economists. . . may have a tendency to 
place more faith in the accuracy of data 
than is sometimes warranted. ... We 
should be diligent to remind ourselves 
that the mere existence of data in 
numerical form does not, in itself, make 
it accurate or error-free." 

DOW ETHERIDGE 



n Chapter 8 we observed that data collection is a major part of any 
Uj empirical research project. Empirical research can be divided into 
|| two types: experimental and survey (or nonexperimental). In ex- 
[v.:...experimental research, the data come from performing tire experi¬ 
ment. Thus, in experimental research collecting the data is, by de fin ition, 
a significant part of tire project. Unfortunately, in survey research where 
one uses preexisting data, data collection is not always given tire same re¬ 
spect; researchers often don’t put the same care and effort into it. This is 
a serious mistake! If you drink I’m wrong here, ask yourself tire following: 
If you use tire most advanced statistical techniques but your data are 
flawed, what land of inferences can you draw from tire results? 

In this chapter we will explain what you need to do at a minimum to con¬ 
struct a satisfactory data set. We will discuss how to develop a search strategy 
for finding appropriate data with a minimum of time and effort, using both 
primary and secondary data sources. We will also discuss how to convert the 
data into a useable form and to construct a data appendix for your research. 







Developing a Search Strategy for Finding Your Data 

When you search for data, just as when you do your literature survey, it’s 
a good idea to start widr a search strategy. Time spent up front in this ac¬ 
tivity will almost always pay off by significantly reducing die effort required 
to find and collect die data you need. The process for developing a search 
strategy is oudined in Figure 9.1. 

Step 1: Before You Search 

At least diree issues are involved in die construction of a good data set. 
The first issue is sample size—you need to have enough data points to de¬ 
rive statistically valid empirical test results. You typically need tiiirty data 
observations to ensure adequate degrees of freedom, diough you can 
sometimes make do witii less. The second issue is that of a random or rep¬ 
resentative sample. We touched on diese two issues in die previous chap¬ 
ter, but we will discuss diem in detail in Chapter 10. 

The third issue is obtaining data that correcdy measure the concepts 
diat your theory deems important. Begin by retiewing your conceptual or 
theoretical model. What variables do you need data for to test your re¬ 
search hypodiesis? If you could obtain die ideal data, what variables would 
diey include? At the highest level of abstraction, these questions should be 
straightforward to answer. If not, you may need to rethink your research 
hypodiesis to obtain more focus. 

Next, come a series of practical questions: What are the different forms 
in which those data might be available? There are several aspects to tiiis 

i Figure 9.1 Developing a Search Strategy for Finding Your Data 

Step 1: Before You Start Searching 

• What are the desired variables? 

° How should each variable be defined? 

° What data frequency and sample period or what level of analysis? 

• What are potential sources for data on each variable? 

Step 2: As You Search 

• What data are available? 

• Are there suitable proxy variables for variables that are unavailable? 

• If not, how can the empirical model be modified to use the data avail¬ 
able but still test the hypothesis? 
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query. The first is conceptual. For example, if your tiieory requires a mea¬ 
sure of income, you might find two types: per capita income and family in¬ 
come. Which is die best measure to test your hypodiesis ideally? 

The second aspect to diis question is whedier you should seek time-series 
or cross-section data. If die former, what frequency of data should you look for: 
annual , quarterly, monthly, weekly, or daily, and over what sample period? If 
die latter, what unit of analysis is most appropriate for your hypodiesis: average 
income at die state level or die comity-level, or even individual family incomes? 

Since diere may be a variety of ways to measure the variable you are in¬ 
terested in, you should think carefully about what your variable would be 
ideally, and dien what you can live with. 

Once you have determined your list of desired variables, die next step 
is to think about where those data are likely to be found—for example, by 
using die sources discussed in Chapter 8. What agency collects that data? 
What data sources were used by previous studies on your topic? 

A good place to start your data search is die website. Resources for 
Economists on die Internet (RFE, http://rfe.org). RFE is an extensive 
source of a variety of information useful for economists. You may recall 
that we introduced it in Chapter 3 as part of our discussion on surveying 
the literature. RFE protides an extensive and annotated list of data sites 
(including a number of more specialized sources omitted from our dis¬ 
cussion in Chapter 8). The annotations are particularly useful since diey 
help you determine if a data site will be useful for your research, without 
your hating to dig into it yourself. RFE’s data collection is divided into five 
sections: U.S. macro and regional data, otiier U.S. data (e.g., data for mi¬ 
croeconomic research), world and non-U.S. data, data from financial mar¬ 
kets, and data from published journal articles. 

Another useful resource for beginning a data search is the website, World 
Wide Web Resources in Economics or WebEc (http://netec.wustl.edu/ 
WebEc). This site protides information similar to that of RFE, but with 
more of an international focus. 

Once you have reviewed RFE and/or WebEc, you should have some 
good ideas about where to look for die data you need. These are likely to 
include both primary and secondary data sources. You may wish to review 
Appendix 8A, which outlined die primary and secondary data sources de¬ 
scribed in Chapter 8. 

Step 2: As You Search 

If you have completed Step 1, you now have a list of ideal variables for test¬ 
ing your hypothesis. You have also identified several promising data sources, 
such as Economagic and FRED II. The next step is to begin die search. 

flSl 
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As you investigate each data source, you need to ask several questions: 

1. What data are, in fact, available? Remember that this question in¬ 
volves both correctly defined variables and a sufficient number of 
data points. 

2. If the data are not the ideal, are they good enough? In practice, it is 
uncommon for the ideal data to exist with a sufficient number of ob¬ 
servations. After all, tire folks who produced tire data probably didn’t 
have your exact research in mind. You might find the exact variable 
you desire but too few data points. Or you might find sufficient data 
points but for a variable that is not defined exactly as you’d like. 

3. If the data are not acceptable, is there an available proxy that is? A 
proxy is a variable that should behave roughly the same as your the¬ 
oretical variable. For example, it should have the same trends and 
turning points as the theoretical variable, though it isn’t conceptu¬ 
ally identical. 

4. If there is no adequate proxy, how can the hypothesis be reformu¬ 
lated to make it testable, given the data available? 

Let’s illustrate these issues with a series of examples. Suppose you would 
like to test the Keynesian consumption function. The theory hypothesizes 
that consumer spending in tire aggregate is a linear function of national 
income or GDR 

C = a + b * GDP 

If you wanted to test this theory for the U.S. economy using annual or 
quarterly data, you would have no difficulty obtaining a large number of 
observations for both C and GDR Macro data for the U.S. economy are 
among the best data in the world; they are available in great detail over 
long time frames. 

But suppose you wanted to apply this theory to a foreign country. Such 
data are often more difficult to obtain, if they are obtainable at all. One 
complication, especially when doing international comparisons, is that 
concepts may be measured differently in different countries. For example, 
the unemployment rate in Japan is not defined in the same way as in U.S. 
data. Additionally, in many countries published macroeconomic data are 
thought to be politically manipulated. Thus, the published data, even if 
available, may not correctly measure the underlying theoretical concepts. 

Consider another example closer to home. Suppose you wished to test 
the Keynesian consumption function over a monthly time frame. Data on 
personal consumption expenditure and GDP are available in annual or 




quarterly frequency, but not monthly. Thus, you would need to find proxy 
variables in order to do a monthly study. Recall what makes a good proxy. 
Either the proxy should measure the same underlying economic behavior 
as tire theoretical variable you wish to study, or it should be highly corre¬ 
lated in a statistical sense with tire theoretical variable. For example, retail 
sales are published monthly, and they measure approximately the same 
tiring as consumption expenditure. Finding an adequate proxy for GDP is 
more problematic. One candidate might be the Fed’s index of industrial 
production, which is available monthly. Industry production measures the 
manufacturing output of the economy. In tire early twentieth century, 
when the U.S. economy was largely industrial, this would have made a 
good proxy for GDP. In recent years, however, manufacturing has declined 
to less than 40 percent of GDP, while tire service sector is roughly 60 per¬ 
cent. To see if industrial production is an acceptable statistical proxy, one 
should examine tire correlation between tire two variables over the closest 
frequency available, in this case, quarterly. Examining data for quarterly 
real GDP and quarterly industrial production indicates a very high corre¬ 
lation of 0.96. Thus, industrial production probably is an acceptable 
proxy. 1 

These problems are often more acute when you are doing a microeco¬ 
nomic study. For these, you trill more likely need to use variables from dif¬ 
ferent sources, and perhaps different definitions. It is also more likely that 
different variables may come from different populations. Suppose your 
theory identifies weekly work hours by couples with children between the 
ages of six and seventeen. The available data may include weekly work 
hours by gender nationwide as well as employment rates among males and 
females in couples with children. You can construct a data series that ap¬ 
proximates the theoretical concept, by making suitable assumptions. For 
this example, you could assume that males and females with children have, 
if employed, the same weekly work hours as males and females in tire gen¬ 
eral population. You still need to be concerned about how well tire result¬ 
ing series serves as a proxy for the desired theoretical variable. Ultimately, 
to perform tire analysis at all, you may have to hope for the best and use 
tire data, but acknowledge its potential weaknesses. 

Oftentimes, researchers must simplify their theoretical model because 
they lack adequate data on tire variables they need. Either there are insuf¬ 
ficient data points for one or more variables, or tire specific variable they 
need is not available. In our example of tire monthly consumption func¬ 
tion, if a proxy for monthly GDP cannot be found, you may need to change 
the study to examine a quarterly consumption function. 





[(Figure 9.2 Different Forms of Data 

• Levels 

• Per capita 

• Changes 

° Rates of Change (or Growth Rates) 

• Annualized Growth Rates 

• Proportions 
° Nominal 

0 Real 

• Index Numbers 


Data Manipulation 

Data for any variable are available in a number of different forms. The 
most important of these are listed in Figure 9.2. In general, it is possible to 
convert a variable from one form into another. In this section we summa¬ 
rize some commonly used techniques for performing those conversions. 
Even if you can find data in die exact form you need so you don’t need to 
peifoim diese calculations, it is nonetheless important diat you under¬ 
stand what each form means and how diey relate to each odier. 

Level of Variable 

The most basic form of any variable is called die level. The level is the 
actual value or size of die variable being measured. For example, Figure 9 3 
shows that the level of U.S. GDP in 2001 was $10,082 billion. 

Often, researchers prefer to use the per capita form of a variable. Per 
capita means per person. It is found by dividing the level of the variable 
by the appropriate population. For example, since the U.S. population 
in 2001 was 285.318 million, U.S. GDP per capita in 2001 was $10,082 

f Figure 9.3 > Levels, Changes, and Growth Rates of U.S. GDP (in billions of $) 


Year 

1999 

GDP 

$9,274.3 

Change in GDP 

Percentage Change 

2000 

$9,824.6 

$550.3 

5.9% 

2001 

$10,082.2 

$257.6 

2.6% 

2002 

$10,446.2 

$364.0 

3.6% 


Pay Attention to the Units! 

Novice researchers often pay little or no attention to the units in 
which variables are measured. This is a mistake you should avoid! In 
our previous example of GDP per capita, GDP is measured in billions 
of dollars, while the population is measured in millions of people. 
Dividing the two yields units of thousands of dollars per person. Note 
that it’s not enough to know that GDP is measured in dollars—you 
should know whether it’s trillions of dollars or milli ons of dollars. 
Suppose you ignored those zeros in GDP and population, and only 
divided the digits: 10,082 /285.318. You would have obtained an 
answer of $35.37 per capita. So if you think that the number of zeroes 
in statistics doesn't matter, ask yourself whether you’d prefer to have 
an annual income of $35,000 or $35. 


billion/285.318 million = $10,082,000 million/285.318 million = $35,336 
per person. 

One common use of the per capita form of GDP is to compare stan¬ 
dards of living between nations. Mexico and Spain have approximately the 
same size economy: Mexico’s 2002 GDP was about $900 billion, while 
Spain’s was $828 billion, so Mexico’s was slightiy larger. However, Mexico’s 
population is substantially larger than Spain’s, so Spain’s standard of living 
is much higher. Mexico.s 2002 GDP per capita is about $9,000 per person; 
by contrast, Spain’s 2002 GDP per capita is more than double that, at about 
$20,700 per person. For similar reasons, when you are performing odier 
types of comparisons, you might use the per family or per household form 
of income or some other variable. 

Change in Variable 

For some purposes, researchers find it more useful to examine the 
change in a variable than the level. For example, knowing that the level of 
U.S. GDP in 2001 was $10,082 billion may be less important than knowing 
diat it increased by $258 billion between 2000 and 2001. In other words, 
the change in a variable is just the difference in die levels from one time 
period to anodier. 

Often, it is important to know whether a change was meaningful. Was die 
increase in GDP between 2000 and 2001 large or small? One way to assess 
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Caution on Internationa! Comparisons 

Each country measures its economic data, such as GDP, in its own 
national currency. In other words, Mexico's GDP is measured in pesos, 
while Spain’s GDP is measured in pesetas. In order to make cross¬ 
country comparisons, as we did earlier, the data need to be converted 
into a common unit of account, a common denominator. Either both 
GDPs need to be in pesos, both in pesetas, or both in some third 
currency, like dollars. This conversion can be done in two ways. The 
most common way is to use an exchange rate, that is, the price of one 
currency expressed in terms of another. Thus, one could take Spain’s 
GDP (in pesetas) and multiply by the U.S.$-peseta exchange rate to 
obtain Spain’s GDP in terms of dollars. Going through the same 
process for Mexico, we would multiply its GDP in pesos by the U.S.$- 
peso exchange rate to yield Mexican GDP in terms of dollars. Then the 
two GDPs could be compared. 

In practice, however, this is not the best way to convert Mexican 
and Spanish GDP. This is because exchange rates fluctuate regularly, 
sometimes dramatically in a short period of time, which makes it 
appear that a country’s GDP has increased or decreased when really it 
has not. For many purposes, then, such as comparing two nations' 
standards of living, it is preferable to use purchasing power parity 
equivalents (or PPP Equivalent) instead of exchange rates to convert 
the GDP figures. Indeed, the figures we just cited were computed with 
this method. The PPP Equivalent is a measure of the relative 
purchasing power of each country's currency. 2 Essentially, it extracts 
the volatility from exchange rates to yield a more accurate, long-term 
measure of the value of a currency in terms of another. Whenever 
possible, you should use the purchasing power equivalent version of 
economic data when comparing across countries. This is usually 
labeled "purchasing power parity $." 

Note also that direct comparisons of GDP or income per capita can 
be misleading when you are making comparisons between economies 
that have different economic systems or levels of economic 
development. For example, the reported income per capita of rural 
residents in a country tends to undervalue their actual standard of 
living, since they often have unreported income-in-kind such as 



Data Manipulation 


produce they grow for their personal consumption. In other words, 
comparing reported incomes per capita between urban and rural 
residents would be biased in favor of the former group. The same 
problem occurs when comparing incomes per capita between 
developed and developing nations. 


this is to look at the rate of change, also called tire percentage change or tire 
growth rate. The rate of change is the change as a percentage of tire origi¬ 
nal level. To compute tire rate of change for some variable X between an ini¬ 
tial period 0 and a subsequent period 1, you can use the following formula: 
G 1|0 = {[Xj/Xo] - 1} * 100 

where X 0 is tire level (or value) of X in the initial period, Xj is (he level of 
X in tire subsequent period, and G li0 is tire rate of change or growth rate 
between periods 0 and 1. 

For example, the growth rate of GDP from 2000 to 2001 was 
{[$10,082.2 billion/$9,824.6 billion] - 1} * 100 
= {[1.0262] - 1} * 100 
- {.0262} * 100 
= 2.62% 

Growth rates are especially useful for comparing concepts that are dif¬ 
ferent orders of magnitude, like tire annual sales of Microsoft versus the 
annual sales of Joe’s Computer Software, Inc. 

By convention, researchers usually compare data in terms of annual 
growth rates, even if the period of time they are examining is more or less 
than a year. The reason for this is so they have a common standard for com¬ 
parison. Growth of GDP over a decade is almost certainly greater than 
growth over a shorter period, but it isn’t clear that GDP is growing at a 
faster or slower rate over the decade until you standardize the two time pe¬ 
riods examined. For periods longer than a year, tire overall growth rate is 
simply divided by the number of years involved. Thus, if GDP grew 20 per¬ 
cent over ten years, the average annual growth rate would be 20/10 = 2% 
per year. 

For periods of time shorter than a year, like a month or a quarter, re¬ 
seal chers usually follow a different procedure. They ask the question, “If 
growth was X% for some period less than a year, what would the cumula¬ 
tive rate be if that growth rate continued for an entire year?” This is called 
an annualized growth rate. You might drink that you could just multiply 
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[figure 9.4 Annualized Growth Rates 


Beginning: 

Sales: 

$100 


Quarter 

I 

= 1 . 10*100 


Quarter 

II 

= 1 . 10*110 


Quarter 

III 

= 1 . 10*121 
$133.1 


Quarter 

IV 

= 1.10*133.1 

$146.4 

46.4% 


quarterly data by four or monthly data by twelve to get annual rates. That 
would m fact provide a rough approximation, but only a rough one because 
it ignores the effects of compounding. (You may recall the way that interest 
on a bank account compounds. Not only do you earn interest on your de¬ 
posit, but also you earn interest on the interest. That’s compounding.) 

In the same way, suppose that a company’s sales were growing at 10 per¬ 
cent during the first quarter of the year. What would that amount to at an 
annual late, assuming that the 10 percent growth rate continued for three 
more quarters? Figure 9.4 illustrates this question. Suppose the level of sales 
was $100 at the beginning of the year. At the end of the first quarter, sales 
would be 10 peicent higher or $110. At the end of the second quarter, sales 
would be 10 percent higher than that, or $111. By the end of the fourth 
quarter, sales would be 46.4 percent higher than when the year started. In 
this case, since growth compounded over four quarters, the annualized 
growth rate (46.4%) is 6.4 percent more than the rough approximation 
(4 * 10% — 40%). If you were using monthly data instead of quarterly, the 
annualized growth rate would be even more than the approximation, since 
the compounding occurs twelve times. In other words, the more frequently 
the compounding is calculated, the greater the error in the approximation. 

To compute an annualized growth rate for quarterly data, use the fol¬ 
lowing formula: 

G q = {[Xj/X 0 ] 4 — 1} * 100 

where X 0 is the level of X in the initial quarter, X t is the level of X in the 
next quarter, and Gq is the annualized rate of change or growth rate. The 
formula for annualizing monthly data is the same except that instead of 
quarterly values of X you would use monthly values. In addition, instead of 
raising the ratio of quarterly values to tire fourth power, you would raise 
the ratio of monthly values to tire twelfth power: 

G q = {[Xj/Xq] 12 — 1) * 100 
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Figure 9.5 Proportions of 2002 U.S. GDP 
(Values for GDP and components are in billions of $) 
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C 
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69.9% 

15.3% 

18.9% 

-4.1% 


A form of data similar to a growth rate is a proportion. A proportion is 
also called a share or a percentage or a fraction. The idea behind propor¬ 
tions is simple: imagine dividing tire economic pie into several parts. Each 
part is a proportion. Thus, labor income (i.e., wages, salaries, etc.) is 
roughly 75 percent of national income. Similarly, capital income (i.e., in¬ 
terest, rentals, etc.) is roughly 25 percent of national income. A propor¬ 
tion is measured as a fraction or percentage of tire whole. Consider tire 
example in Figure 9.5. Airy student of macroeconomics knows that GDP 
can be divided into consumption expenditure, investment expenditure, 
government expenditure, and net export expenditure. Each of these frac¬ 
tions is a proportion. The proportion of GDP that is consumption expen¬ 
diture is computed as tire level of consumption expenditure divided by the 
level of GDP, or 69.9 percent. 

Proportions are useful for understanding a wide range of issues in eco¬ 
nomics. For example, Americans are drought to have a low saving rate. 
The saving rate is just tire proportion of income that is saved. Is the 
amount of poverty in America growing or shrinking? One way to assess that 
is by looking at the poverty' rate, which is the proportion of Americans with 
income below tire poverty line. 

Real versus Nominal Magnitudes 

A great deal of tire available economic data are based on a simple identity: 
V = P X Q 

where V = nominal (or value) 

P = price, and 
Q = real (orvolume). 

Nominal data are data measured by using the actual market prices that 
existed during tire time period in question. For example, tire value of a 
firm’s output in 1999 is equal to tire product of the price the firm charged 

fj 
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in 1999 and die quantity of output it produced in that year. Similarly, nom¬ 
inal GDP in 1999 is tire sum of the price times the quantity of ever)' prod¬ 
uct produced in a nation in 1999. Nominal data are easy to interpret. 

Real data are a bit more difficult to understand. At tire micro level, real 
data refer to tire actual quantities employed by a firm (e.g., labor hours), 
produced by a firm (e.g., number of widgets), or sold by a firm (i.e., sales 
volume). In our example, tire firm’s real output is the actual quantity of 
goods or sendees that it produced. 

The story becomes more complex, however, when we move above tire 
level of an individual firm to a more macro perspective. How does one 
measure real GDP, tire actual amounts of goods and services produced by 
a nation? One could create a list of all the products produced in every 
industry. But aside from being cumbersome, such a real measure of GDP 
would make it difficult or impossible to compare GDP from one year to tire 
next. If last year a nation produced 50,000 automobiles and 17,000 new 
homes, while this year tire figures were 52,000 and 14,000, did GDP rise or 
fall? The solution economists have found to deal with this quandary is to 
define real GDP as a weighted average of all tire goods and sendees pro¬ 
duced, where tire weights employed are tire prices of each product in a 
given year-. 3 In other words, money prorides tire common denominator by 
which to measure diverse goods and sendees produced in a year. 4 

Index Numbers 

Real GDP, as just defined, is air example of what economists call an index 
number, or more specifically, a quantity index, since it measures tire average 
quantity of something. The other type of index number is a price index. 
These are more familiar to noneconomists. For example, most people have 
heai'd of tire consumer price index. What is a price index? A price index is 
(roughly speaking) a weighted average of the prices of some type of good or 
sendee, where tire weights are the corresponding quantities in a given year. 3 
One could construct an energy price index (tire average price of different 
energy sources), a food price index (the average price of different foods), 
or a consumer price index (tire average price of goods and sendees con¬ 
sumers typically buy). An index number is a data series that begins at 100 
and typically increases in subsequent years, though not necessarily continu¬ 
ously. For example, Figure 9.6 gives the consumer price index for 1984-1993. 

How does one interpret an index number? Suppose you leam that in 
July 2003 tire consumer price index had a value of 183.9. What exactly does 
that mean? The short answer is that 183.9 doesn’t mean anything in isola¬ 
tion. Index numbers, unlike most other statistics, have no units. Rather, in¬ 
dex numbers are designed for comparison purposes. For example, one 


1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 

103.9 107.6 109.6 113.6 118.3 124.0 130.7 136.2 140.4 144.0 


could use an index number to compare tire level of whatever tire index is 
measuring (for example, consumer prices) to some earlier time period 
known as tire base period. Thus, a CPI of 183.9 means that on average con¬ 
sumer prices have risen 83.9 percent since tire early 1980s, which was tire 
base period for these figures. 

Index numbers are typically constructed by taking tire raw index, that 
is, the actual weighted average just mentioned, and transforming it using 
the following formula: 

X T = [X t /X„] * 100 

The variables in tire formula are defined as follows: 

X t is the value of the raw variable in a given time period t in tire series, 
X 0 is the value of the raw variable in tire base period, which is the pe¬ 
riod to be compared against, and 
X T is tire resulting index number. 

Note that in tire base period X t = X 0 . Therefore, an index number al¬ 
ways has a value of 100 in tire base period.- 

Index numbers can also be used to compare tire relative growth rates of 
two similar concepts, such as employment in Colorado and Virginia. They 
are particularly useful when tire concepts have very different magnitudes. 
Imagine trying to graph employment in your local town and employment 
in tire nation as a whole over the past ten years, both on the same graph. 
The national figures would be so much larger than the local ones as to 
make it veiy difficult to compare them. 

To make it easier to see what was going on, you could index both data 
series to a common starting point, say, ten years ago. What you would be 
doing is transforming each series to have an arbitrary value of 100 in that 
year, using the formula for computing index numbers given earlier. Then 
since each would have a common starting point, you could easily see how 
employment growth in your town compared to the national economy. 

Quantity Indices Versus Real Quantities 

There are two ways to construct real measurements. The first is to create a 
quantity index, as described earlier (i.e., a weighted average of the quantities 
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of the type of good or service one is interested in). The second is to create a 
price index and then diride die nominal value by die price index. The result 
is real and follows direcdy from die identity above: Q = V/P. The data for real 
GDP in die U.S. National Income and Product Accounts are created using 
the second approach. 

Price Indices 1/erst/s Implicit Price Deflators 

There are similarly two ways to construct price measurements. The first 
is to create a price index as described earlier (i.e., a weighted average of 
die prices of die type of good or sendee one is interested in). The second 
is to create a quantity index and then divide die nominal value by die 
quantity index. The result is called an implicit price deflator, since it is 
die price index implied by the identity, which is expressed in this case as 
P = V/Q. Note that published price indices are multiplied by one hundred. 

You might think that a price index and the coiTesponding implicit price 
deflator would be the same. However, though we tend to use them equiv¬ 
alently, only under extreme technical assumptions will that be die case. 6 

How Inflation Distorts Nominal Values 

Because prices tend to increase over time, comparing nominal mea¬ 
surements can be misleading. For example, if you receive a 10 percent 
salary increase from one year to next, while at the same time die prices of 
the tilings you typically buy experience 10 percent inflation, you will not 
enjoy an increase in your standard of living equivalent to your nominal 
raise. In fact, your living standard, or more precisely your real income, will 
not have changed at all. This is die concept behind another way to de¬ 
scribe real measures: as nominal measurements adjusted (or corrected) 
for the effects of inflation. 

The way this “inflation adjustment” is done is by recalling that GDP is a 
weighted average of Q, where the Ps are prices in a specific period, called 
the base year. By using base-year prices but actual-year quantities, real GDP 
excludes the effects of changing prices over time. (Note that if relative 
prices change significantly, that can cause a bias in the measure.) 

Rebasing Data Series 

Periodically, those who collect the data change the base year. One rea¬ 
son they do this involves how one interprets the base year from tire point 
of view of the index number. That is, the base year is die period against 
which the index measures changes in prices or quantities. We care more 
about recent economic changes than historic ones, so the base year is 
changed to keep it “recent.” A second reason for changing tire base year is 
to correct the bias mentioned in the last section. 

/'SSSs, . 

|I2- 



NOTES FOR NOVICE RESEARCHERS 

Practical Data Tip 

Any nominal variable can be transformed into its corresponding real 
magnitude by dividing by an appropriate price index: Q = V/P. An 
easy way to see this is by examining Tables B-l, B-2, and B-3 of the 
Economic Report of the President, which show nominal GDP, real GDP, 
and the GDP deflator, respectively. 7 

Similarly, if one has data for nominal and real (for example, export 
sales and export volumes), one can derive a price index: P = V/Q . 

Note that such an index is called a unit value. 



Re-basing a data series can cause difficulties for researchers. This is 
particularly a problem with print data sources. When you compile data 
from print sources, you should always start with the most recent source 
and work backward. This will enable you to capture revisions in the data 
that were not available until a subsequent date. For example, suppose you 
are compiling an annual data series, and each year your source reports 
the estimated figure for the current year, and also revised data for the pre¬ 
vious two years. If you start from the earliest source and go forward, your 
data series will consist of the earliest and least reliable estimates of the 
variable. 

Suppose you have a time series of annual CPI data in which the first half 
of the data uses a base year of 1992, while the second half uses a base year 
of 1997. There is no perfect way to link the two series! Every way imposes 
some bias, but that is the nature of working with real-world data. To link 
the two parts of the series it is necessary to have some overlap, as shown in 
Figure 9.7 for the year 1995. The first row shows die source data with abase 
year of 1997. (This should be clear since the 1997 value for the data is 100.) 
The second row shows the source data with a base year of 1992. Notice in 
particular that if we merely combined the two series there would be a no¬ 
ticeable jump where the two series connect. In short, the data would show 
an incorrect change in the CPI in that time period. For example, if you 
combined the base 1992 data through 1994 widi the base 1997 data be¬ 
ginning in 1995, you would see a large apparent decrease in the CPI be¬ 
tween 1994 (i.e., 1.30) and 1995 (i.e., 0.85). This would indicate deflation, 
when in fact prices are rising! 
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Figure 9.7 

Rebasing a Data Series 





Year: 

1991 1992 

1993 1994 

1995 1996 

1997 

1998 

Base Year 
1997 



0.85 

0.95 

1.00 

1.15 

Base Year 
1992 

0.95 1.00 

1.25 1.30 

1.40 




Linked 

Series 

0.58 0.61 

0.76 0.79 

0.85 

0.95 

1.00 

1.15 


Suppose we want a complete series with a 1997 base year. We will need 
to transform the values for the observations that are only available with a 
base year of 1992 so they correctly show the change in CPI between both 
parts of tire source data. The data with the earlier base year need to be re¬ 
duced in value to correctly link to the data with tire later base year. The 
amount of the reduction is given by the ratio of the two values for 1995, 
the period of overlap. In other words, dividing tire base-year 1997 value 
into the base-year 1992 value yields the reduction factor: 85/140 = 0.607. 

To obtain the linked series with a 1997 base year, use this factor to mul¬ 
tiply tire value of each observation with base-year 1992. The result is shown 
in the third row of Figure 9.7. 8 Notice that there is no longer a drop in the 
series where the two parts were linked. 

Data Smoothing 

Data that are volatile are sometimes “smoothed” to better reveal the un¬ 
derlying trends. Consider the monthly income of a salesperson who works 
entirely on commission. One month his income might be quite high, while 
the next it could be low. That variability makes it difficult for the salesper¬ 
son to tell if he’s having a good year or not. The problem is similar for re¬ 
searchers using certain types of data. 

There are a variety of techniques for smoothing data. We will discuss 
two such techniques here: moving averages and seasonal adjustment. A 
moving average replaces the actual data point in each period with an av¬ 
erage of the n - 1 preceding data points with the nth. The result is that any 
“abnormal” observations become less important since they are averaged 
with more normal ones. 

Some variables show seasonal patterns, whereby they change pre¬ 
dictably in certain months, quarters, or seasons. For example, consumer 
spending rises in the fourth quarter of every year in anticipation of Christ- 
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Data Manipulation 


mas. Often, researchers wash to know if the changes are abnormal. Thus, 
one might ask, did consumers spend more or less than normal this Christ¬ 
mas season? This is a question for which seasonally adjusted data are de¬ 
signed. The U.S. Census Bureau has developed techniques that are 
commonly used to seasonally adjust data. The most recent version is called 
X12. This version of the earlier XI1 technique is widely available on com¬ 
mon statistical packages or it can be obtained free from tire Census Bu¬ 
reau. Note that many macro data series are only available in seasonably 
adjusted form. These include GDP, CPI, PPI, retail sales, and housing 
starts. It is unlikely that you will need to do your own seasonal adjustment 
of data, but it is important that you understand what the term means. Note 
the following abbreviations, which are often used with data sources: 

NSA: Not seasonally adjusted. 

SA: Seasonally adjusted. 

SAAR: Seasonally adjusted annual rate. 

Appendix 11A discusses some additional ways to manipulate data. 
Whatever data manipulation you do, once you have your completed data 
set it is good practice to review the data. Examine each data series by look¬ 
ing at either data tables or charts. Look for observations that appear to be 
errors. This is easiest with time-series data that show a trend. A good ex¬ 
ample is tire CPI, as shown earlier. Just because data is published does not 
mean that is error free. Moreover, if you performed any data entry, there 
is always tire possibility you made a keystroke error. For cross-section data 
you should compute summary descriptive statistics: mean, range, and stan¬ 
dard deviation for each variable. You can report these as part of your data 
appendix. This is not generally done for time-series data, which often con¬ 
tain a time trend. Since the purpose of the descriptive statistics is to give 
you a rough sense of what tire data are like, with time-series data one would 
be likely to discover that the lowest data point was the earliest, while tire 
highest data point was the latest. With a time trend, you already know that. 

Only when you have examined the data and satisfied yourself that they 
are as accurate as they can be are you ready to begin statistical testing. 

Constructing a Data Appendix to Your Research 

It is good scientific practice to make your data available so drat other re¬ 
searchers may replicate your work. At a minimum, this means explaining 
the “Sources and Methods” you used in compiling your data set. We can 
categorize data as raw (or source) data or derived data. Raw data are the 
data as you found it at the source. Derived data are tire data that you actu¬ 
ally used in your empirical testing. This distinction is important because it 
is not uncommon for the raw data you obtain to be in a form different 
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from what you need to test your hypothesis. For example, the data avail¬ 
able from the source might be the consumer price index, while the data 
you need might be die inflation rate, that is, the percentage change in CPI. 
Thus, you may need to manipulate or mathematically transform tire data 
before using it. In this example, the CPI would be the raw data, while the 
inflation rate would be the derived data. 

In explaining your sources and methods you should provide clear cita¬ 
tions for exactly which sources provided the raw data, as well as a complete 
explanation of how you manipulated the raw data to put it in tire form you 
actually used. After reading your documentation, the reader should be 
able to replicate your exact data with a minimum of digging. Better yet, 
you should provide a copy of the exact data you used, in addition to doc¬ 
umenting where it came from. A number of journals require that authors 
include copies of their data when they submit their research papers for 
publication. 9 This copy of your data with documentation is called a data 
appendix. 

SUMMARY 

• Since the quality of your research will be no better than the quality of 
your data set, you should put a substantial effort into constructing a good 
quality data set. 

° To efficiently locate data you should begin with a search strategy. Begin 
by creating a list of the desired variables for your research. Next, con¬ 
sider likely sources for that data. Think about what proxies might be 
available for data that are unavailable. Finally, consider how the model 
might be revised to use the available data. 

• Data are available in different forms, including levels, per capita, 
changes, rates of change, and proportions. 

0 Growth rates for periods of less than a year are often given in annual¬ 
ized form. 

® Nominal data are distorted by inflation; real data are corrected for 
inflation. 

° An index number is essentially a weighted average of whatever the num¬ 
ber purports to measure. Thus, the consumer price index (CPI) is a 
weighted average of the prices of goods and services consumers typically 
buy. Index numbers are designed to show changes in prices or quantities 
since some arbitrary base period. 

• A data appendix prorides the exact data the researcher used as well as a com¬ 
plete description of the sources and methods he or she used to obtain it. 


Suggestions for Further Reading 


IMOTES 

1. Monthly values of industrial production were averaged to obtain a quar¬ 
terly series. Real GDP is available quarterly. Correlation will be ex¬ 
plained in detail in Chapter 10. 

2. The purchasing power parity (PPP) equivalent gets its name from the 
fact that it measures currencies in terms of the goods they can buy. PPP 
is based on the notion that in equilibrium tire same product should cost 
the same amount in two different currencies when adjusted by the ex¬ 
change rate. This should be time in equilibrium; if these costs are not 
the same, it would be profitable to export from where it is cheaper and 
import to where it is more expensive until the costs are equal. There¬ 
fore, PPP is a long-term equilibrium measure of the exchange rate. 

3. Thoughtful readers may notice that this is tire same value as nominal 
GDP. This point will be clarified shortly. 

4. This is exactly what is meant by characterizing money as “a unit of account.” 

5. For some price indices the weights are budget shares rather than quan¬ 
tities; in other words, tire weight of some consumer product price is the 
proportion of that item in the average consumer’s budget. 

6. For similar reasons, a quantity index and the corresponding real quan¬ 
tity are rarely equal, though this is easier to see because the units are dif¬ 
ferent. Real quantities are measured in constant (e.g., 1992 dollars), 
while quantity indices like all index numbers are unit free, that is, they 
are numbers like 100, 117, 129. This will be explained in more detail 
later in this chapter. 

7. Note, however, that for the math to work out correctly the data for the 
GDP deflator must be divided by 100. 

8. The careful reader will notice that this procedure for rebasing index 
numbers is a variation on the formula for creating index numbers pre¬ 
sented earlier. 

9. Recall that the last section of data in tire Resources for Economists web¬ 
site was data from published journal articles. 

SUGGESTIONS FOR FURTHER READING 

DataBasics, Federal Reserve Bank of Dallas website, http://dallasfed.org/data/ 
basics/index.html —A concise summary of the major techniques for ma¬ 
nipulating data. 

Wyrick (1994) —Offers explanations of a variety of data manipulation 
techniques. 
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EXERCISES 

1. Suppose you are researching the question of why die U.S. unemploy¬ 
ment rate for African Americans is higher than die U.S. average unem¬ 
ployment rate. Create a list of variables that you diink might explain 
diat question. For each variable, propose a source for that data. Next, 
visit each source, and describe the data you found. For example, if diei e 
are time-series data, what frequency is available and for what time pe¬ 
riod? If there are cross-section data, what level of analysis is available? 
Draw a conclusion as to whether die data is adequate to test the hy¬ 
pothesis empirically. 

2 Go to die Bureau of Economic Analysis Fixed Assets webpage at http:// 

' www.bea.doc.gov/bea/dn/faweb/AUFATables.asp#Sl. Download data 
for Current-Cost Net Stock of Fixed Assets and Consumer Durable 
Goods (Table 1.1) for the past ten years available. Open the down¬ 
loaded file in a spreadsheet. Compute the year-to-year growth rate for 
nonresidential fixed assets. This is die capital stock of private business 
in the United States, in odier words, Kin the standard production func¬ 
tion Q = f [ L, K, t ]. Is there a trend in the growth rates? Are they in¬ 
creasing, decreasing, or staying about the same? 

3. Go to die Census Bureau’s webpage on U.S. Trade Balances by Country 

at http://www.census.goy/foreign-trade/balance/index.html. What 

were U.S. imports from Mexico, Canada, Japan, and China in 2001? 

Next, go to http://www.census.gov/foreign-trade/statistics/historical/ 

gands.txt to obtain total U.S; imports in 2001. Finally, compute the pro¬ 
portion of total U.S. imports in 2001 that came from Mexico, Canada, 
Japan, and China. 

4. Collect data on quarterly nominal U.S. GDP for the past five years avail¬ 
able. One source of this data is the Bureau of Economic Analysis’s web¬ 
page at http://www.hea.doc.gov/bea/dn/nipaweb/TableYiewFixed. 

asp?SelectedTable=3&FirstYear= 2002&LastYear= 2003&Ffeq=Qtr. 
Next, collect data for the U.S. implicit price deflator for GDP for the 
same time period, at the same webpage. Now, deflate the nominal GDP 
using the GDP deflator to get real GDP. Please show your work. Finally, 
to check your work, collect data for real GDP at the same BEA webpage. 
Compare your computed real GDP with the data you downloaded. 

5. For each of the variables used in Exercise 1, proride complete citation 
information on how you obtained the data. 
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What Are Chained-Dollars? 


In the chapter, we defined an index number as a weighted average. For 
example, a quantity index for real GDP would be a weighted average of the 
quantities of all products in GDP, where the weights are tire prices of those 
products. To be more precise, most index numbers are constructed as the 
ratio of two weighted averages. In other words, a quantity index for real 
GDP would be tire ratio of average production in the current period to av¬ 
erage production in the base period. For a quantity index, the natural 
choice for weights is tire corresponding prices, but which prices? Tradi¬ 
tionally, there were two choices: the base period prices or the current pe¬ 
riod prices. If tire former was chosen, the result was called a Laspeyres index, 
if the latter was chosen the result was called a Paasche index. For example, 
the consumer price index is computed using base period quantities as 
weights; thus, the CPI is a Laspeyres index. Each of these choices intro¬ 
duces a bias into the results. The bias of the Laspeyres index is in the op¬ 
posite direction of the bias in the Paasche index. In recent years, a new 
weighting scheme has been introduced by tire U.S. Bureau of Economic 
Analysis to deal with this issue. The result is called a chain-type index, which 
is defined as the geometric mean of the Paasche and Laspeyres indices. 
Note that unlike the fixed weights of a Laspeyres or Paasche index, for a 
chain-type index the weights change for every observation. 
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Example of a Data Appendix 

Military Enlistment = f(Unemployment Rate, Military/Civilian Pay Ratio, 
War Status) 

Air Force, Army, Marines, Navy— Number of active duty enlisted soldiers 
in each of the respective branches of the U.S. military 

U-Rate_The national Unemployment Rate averaged from monthly ob¬ 

servations to yield annual rates 

Pay Ratio—Ratio of military to civilian pay. Military Pay is measui ed as the 
monthly basic pay for active duty enlisted men and women of classifica¬ 
tion E-3 with two or fewer years of service. These monthly observations 
were divided by 160 to represent a 40-hour workweek. Civilian Pay is de¬ 
fined as tire hourly earnings of civilian production workers. Monthly ob¬ 
servations were averaged to produce annual averages. 

War_This is a dummy variable with l’s for the years during which tire 

United States had a major foreign conflict. 


YEAR 

AIR FORCE 

ARMY 

marines 

NAVY 

U-RATE 

PAY RATIO 

WAR 

1981 

570,302 

781,419 

190,620 

540,219 

7.616667 

0.540545 

0 

1982 

582,845 

780,391 

192,380 

552,996 

9.708333 

0.531489 

0 

1983 

592,044 

779,643 

194,089 

557,573 

9.6 

0 

1 

1984 

597,125 

780,180 

196,214 

564,638 

7.50833 

0.512308 

1 

1985 

601,515 

780,787 

198,025 

570,705 

7.191667 

0.517612 

0 

1986 

608,199 

780,980 

198,814 

581,119 

7 

0.521581 

0 

1987 

607,035 

780,815 

199,525 

586,842 

6.175 

0.524918 

0 

1988 

576,446 

771,847 

197,350 

592,570 

5.491667 

0.518359 

0 

1989 

570,880 

769,741 

196,956 

592,652 

5.258333 

0.51926 

1 

1990 

535,233 

732,403 

196,652 

579,417 

5.616667 

0.517419 

1 

1991 

510,432 

710,821 

194,040 

570,262 

6.85 

0.522679 

1 

1992 

470,315 

610,450 

184,529 

541,886 

7.491667 

0.531482 

1 

1993 

444,351 

572,423 

178,379 

509,950 

6.908333 

0.537681 

1 

1994 

426,327 

541,343 

174,158 

468,662 

6.1 

0.535501 

1 

1995 

400,409 

508,559 

174,639 

434,617 

5.591667 

0.534311 

0 

1996 

389,001 

491,103 

174,883 

416,735 

5.408333 

0.529458 

0 

1997 

377,385 

491,707 

173,906 

395,564 

4.941667 

0.52527 

0 
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1998 

367,470 

483,880 

173,142 

382,338 

4.5 

0.518798 

1999 

360,590 

479,426 

172,641 

373,046 

4.216667 

0.518653 

2000 

355,654 

482,170 

173,321 

373,193 

3.975 

0.522991 

2001 

353,571 

480,801 

172,934 

377,810 

4.758333 

0.522497 

2002 

368,251 

486,542 

173,733 

385,051 

5.775 

0.544941 


Data Sources: 

Bureau of Labor Statistics, Data Series LNS 14000000: Seasonally Adjusted 
National Unemployment Rate, http://data.bls.gov/PDQ/servlet/ 
SurveyOutputServlet?data_tool : =latest_numbers&series_id=LNS1400 
0000 (accessed November 15, 2004). 

Used for U-Rate 

Bureau of Labor Statistics. Data Seiies CEU0500000006: Average Hourly 
Earnings of Production Workers, http://data.bls.gov/PDQ/servlet/ 
SurveyOutputServlet;jsessionid=f0307d'7548c5$3F7$3Ft (accessed No¬ 
vember 15, 2004). 

Used for Pay Ratio 

Defense and Accounting Service. Military Pay Prior Rates, http://www.dfas.mil/ 
money/milpay/priorpay/ (accessed November 16, 2004). 

Used for Pay Ratio 

Department of Defense. DoD Active Duty Military Personnel Strength Levels, 
Fiscal Years 1950-2002, http://wvw.dior.whs.mil/mmid/military/ms9.pdf 
(accessed November 12, 2004). 

Used for Air Force, Army, Marines, Navy 

Military.com. 2004. Military.corn’s Military History: The Complete List, 
http://www.military.com/Resources/HistorySubmittedFileView?file= 
history_completelist.htm (accessed November 12, 2004). 

Used for War 

Source: Data appendix courtesy of David Hutchinson. 
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"No amount of experimentation can ever 
prove me right; a single experiment can 

prove me wrong ." 

ALBERT EINSTEIN 


[iff hroughout this book we have argued that research involves cre- 
ating a persuasive argument and that researchers persuade by 
H providing theoretical and empirical evidence that supports their 

EJlilSthesis. This chapter and the following one will explain how to de¬ 
velop convincing empirical evidence. 

In this chapter, we introduce the concept of research design, or how to 
create a valid empirical test of a hypothesis. We survey three types of in¬ 
creasingly sophisticated empirical analysis: casual empiricism, simple sta¬ 
tistical hypothesis testing, and multiple regression. Then we discuss 
common problems of data analysis that weaken or limit the validity of 
these empirical tests. We conclude by explaining why regression analysis is 
the empirical tool of choice for economists. In Chapter 11 we survey re¬ 
gression analysis in detail. 

Key Issues of Research Design 

Okay. You have an interesting research question. You’ve reviewed the 
literature on the topic area. You have analyzed the issue using economic 
theory, and from that theoretical analysis you have derived your hypothe¬ 
sis, that is, your proposed answer to the research question. Next, you need 
to test the hypothesis. In other words, you must assess whether or notyour 
answer to the research question is correct. How do you design an appro¬ 
priate test? There are a number of key issues to consider. 






Key Issues of Research Design 


Two general types of empirical methodology are available to re¬ 
searchers: experimental and survey (or nonexperimental) methods. Ex¬ 
perimental methods are illustrated by “laboratory” experiments. For 
example, the researcher might examine two sample groups, an interven¬ 
tion or treatment group, and a control group. The control group is de¬ 
signed to be in every way identical to the treatment group except for the 
treatment. Thus, if the experiment between the two groups yields differ¬ 
ent outcomes, then that difference can be ascribed to tire treatment. 

Suppose a professor teaches two sections of the same course. The two 
sections have tire same lectures, assignments, and examinations, but in 
one section the instructor is trying a new textbook. If that section receives 
higher grades in tire course, it could indicate that the new text was more 
effective than the other text. Of course, other possibilities need to be con¬ 
sidered as well. It may be that students in tire more successful section were 
brighter or more experienced in economics. Thus, in a well-designed ex¬ 
periment, the students would need to be randomly distributed between 
the two sections to control for outside factors. 

Traditionally, research in economics has used survey or nonexperi¬ 
mental methods. (In recent years, however, experimental economics has 
been growing in importance). Survey research involves the passive obser¬ 
vation and analysis of events as they occur in nature. For example, instead 
of the Federal Reserve inducing a recession, and then analyzing the out¬ 
comes, Fed economists use data on recessions as they have occurred in his¬ 
tory to investigate their causes and effects. Survey research uses statistical 
methods to account for outside factors. This will be discussed later in this 
chapter in detail. 

A critical factor in designing an empirical study is the degree to which 
the methodology is valid. What makes for a valid empirical research de¬ 
sign? The validity of a study has multiple dimensions. At the broadest level, 
one needs to consider internal and external validity. 

A study has internal validity if the impact observed can be attributed to 
the study variable, in other words, if given the assumptions and evidence 
one can deduce that X causes Y. 1 Returning to our teaching experiment, 
if die two class sections are identical except for the textbook used, we can 
logically conclude that the difference in grades is attributable to the 
choice of text. In short, die experiment has internal validity. 

There are several elements to consider when assessing internal validity. 
The three most important are instrument validity, relationship validity, 
and causal validity. 2 Instrument validity asks whetirer the test instrument 
adequately measures what it purports to. For example, it is not uncommon 
for researchers to have difficulty finding ideal data. Are the available data 



sufficient for die task? Do they serve as adequate proxies for the underly¬ 
ing theoretical concepts? Studies of teaching techniques often use grades 
to measure the amount of learning that is occurring. Are grades a valid in¬ 
strument for measuring learning? 

Relationship validity asks how conclusive the empirical testing was. Was 
die test appropriate and reasonable? In short, can one conclude based on 
die empirical test drat diere is in fact a statistical relationship? Suppose 
one examines data on energy prices and energy consumption for the 
decades before and after die Energy Crisis of the 1970s. The price of en¬ 
ergy increased dramatically, while energy consumption increased as well. 
Does that imply that the law of demand does not hold for energy? Casual 
examination of data on energy prices and consumption lacks relationship 
validity because it fails to control for other factors that may have affected 
energy consumption; in other words, die ceteris paribus assumption 
(“odier things being equal”) does not hold. In the case of the energy cri¬ 
sis, GDP was increasing over the same period, so that actual energy con¬ 
sumption rose despite the higher prices, not because of them. 

Causal validity observes that since correlation does not imply causation, 
can one be sure that the hypothesized causal relationship is valid? Can one 
be sure that the causation does not go in die opposite direction, or that 
there is no causation at all, merely a statistical association? For example, 
consumer spending and personal incomes are highly correlated. Can one 
conclude tiiat the spending is the result of die income, or the converse? 
Does yacht ownership cause people to be rich, or does the causation run 
in die odier direction? These questions rarely have easy answers, but care¬ 
ful researchers will ask them nonedieless. 

Once you have established the internal validity of a given study, you 
need to ask if die results can be generalized to odier situations, applica¬ 
tions, or circumstances. If so, die study has external validity. For example, 
suppose a study found a positive relation between class attendance and 
GPA for students at small liberal arts colleges. Would the same results hold 
for courses at a research university taught in large lecture classes and us¬ 
ing multiple-choice exams for assessment? Perhaps not. If so, we would 
conclude that the original study has external validity; if not, we would draw 
the opposite conclusion. 

Note that there tends to be a U'adeoff between internal and external va¬ 
lidity. As researchers tinker with their model to obtain good statistical re¬ 
sults, those results have tire tendency to become specific to that case and 
less generalizable to other situations. 

The purpose of empirical testing is to search for evidence in the data 
that supports your hypotiresis. But a good empirical test does more than 
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that. It also rules out alternative hypotheses. Your research attempts to de¬ 
velop an argument by using logic and evidence in favor of your hypothe- 
sis. The problem is that the data in the real world may be consistent with 
alternative hypotheses as well as with yours. To be successful, you must 
select an empirical test that adequately disciiminates between your hy¬ 
pothesis and alternatives. Statisticians define the power of a test as the 
probability of correctly rejecting a null hypothesis when it is not true. So 
when you select your empirical test, you should ask yourself the following 
question: if the test yields die strongest possible statistical results, how con¬ 
fident can I be that my hypothesis is confirmed? In other words, will the 
test adequately discriminate between your hypothesis and alternatives? If 
not, then you should consider a more powerful test. 3 

Consider the following example. In the early 1980s, the Reagan Ad¬ 
ministration succeeded in reducing personal income taxes by 25 percent. 
Following tire tax cuts, the U.S. economy grew for the remainder of the 
decade, and this growth was trumpeted as a successful example of supply- 
side economics. But was it? Two decades earlier, the Kennedy tax cuts and 
resulting economic expansion were heralded as evidence of the effective¬ 
ness of Keynesian economics. One might ask which hypothesis was cor¬ 
rect—were the Reagan tax cuts evidence of a supply-side expansion or a 
Keynesian one? 

A key prediction made by the supply-side theory is that tax cuts, by in¬ 
creasing the after-tax return consumers gain by saving, should increase 
their saving rate. Examining the actual saving rate after the Reagan tax 
cuts would be a way to test this theory. In fact, however, such an examina¬ 
tion is not a very strong test, since the Keynesian theory also predicts an in¬ 
crease in the saving rate (or individual’s average propensity to save). The 
supply-side theory hypothesizes that the higher saving rate stimulates the 
economy, while the Keynesian theory postulates that the stronger econ¬ 
omy promoted by tax cuts raises the saving rate. As a consequence, the em¬ 
pirical test of looking for an increase in the saving rate cannot help us 
discriminate between the two competing theories. 

As you design, conduct, and interpret your empirical testing, you will 
need to keep these validity issues in mind. We will discuss them again in 
the next section in the context of specific testing methodologies. 

How Does One Analyze Data? 

When a researcher begins to think about empirical testing, he or she 
starts by asking the question: What are the implications (or predictions) of 
my theoretical analysis? If the hypothesis is true, what evidence should one 
expect to see? The answer to these questions is called the theoretical pre- 
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diction of the analysis. Think about how doctors look for symptoms to 
identify an illness. A woman visits the doctor’s office, saying she thinks she 
has tire flu. The doctor thinks to himself: “If she has the flu, she should ex¬ 
hibit tire following symptoms: headache, fatigue, fever.” So the doctor asks 
tire patient: “What are your symptoms?” It is the symptoms that allow the 
doctor to make a diagnosis. In the same way, it is tire predictions of the 
theory that allow the researcher to test his or her analysis. 

We looked at several examples of this process in Chapter 7 . The first ex¬ 
ample involved a study of tire demand for diamond jewelry. The analysis 
predicted a negative relationship between price and quantity demanded. 
Another example asked whether parental work hours adversely affect chil¬ 
dren’s cognitive development. If so, work hours and cognitive develop¬ 
ment should be negatively related. 

Once the predictions of the theory are identified, the researcher next 
asks: Is the evidence of the real world consistent with these predictions? 
How exactly does tire researcher “examine” the evidence to answer this 
question? We can differentiate between three methods: casual empiricism, 
simple hypothesis testing, and multiple regression analysis. 

Casual Empiricism 

Casual empiricism is the type of analysis one does when constructing a 
data appendix. It includes printing and graphing data, computing simple 
descriptive statistics, and visually examining tire results. Because it is sim¬ 
ple and easy to perform, tire researcher can often learn much from this ap¬ 
proach, especially when initially looking for hypotheses. 

For example, the Phillips curve is the hypothesis that there is an ixrverse 
relation between tire rates of inflation and unemployment. Figure 10.1 
shows the data for these two variables during the 1960s, when the Phillips 
curve relationship gained prominence. Upon inspection, you ,can easily 
see the discernible negative relationship. This important macroeconomic 
relationship was discovered through casual empiricism. 

Note that once you have identified a hypothesis, you should collect a 
new set of data to test it. You should never test tire hypothesis with tire same 
data you used to develop it. 

Airy sample of data can be described in terms of its descriptive statistics. 
You can think of descriptive statistics as statistics that summarize the un¬ 
derlying data. Descriptive statistics include measures of central tendency 
and measures of dispersion. A measure of central tendency is the answer to 
tire question: if you had to summarize a sample of data using only one mea¬ 
sure, what would you use? The answer is an average. There are three com¬ 
monly used measures of central tendency or averages. These are the mean 
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Figure io.i Phillips Curve 
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or arithmetic average, the median or middle value in die sample, and die 
mode or most common or frequendy occurring value in the sample. 4 

How well an average summarizes a data sample depends on the disper¬ 
sion of the data. There are three commonly used measures of dispersion. 
The first is the range between die highest and lowest values in die data 
sample. The second is die standard deviation, which roughly measures the 
average amount by which a data point differs from die mean value of the 
sample. The variance, die diird commonly used measure of dispersion, is 
the squaie of die standard deviation. Any spreadsheet or statistical soft¬ 
ware package will generate descriptive statistics for a data set. 

Most students are familiar with die standard bell-shaped or normal dis¬ 
tribution of data. Researchers often assume that their data are normally 
distributed or approximately normally distributed. If so, all three mea- 
suies of cential tendency will be about die same. If not, the values for die 
mean, median, and mode of a data sample can differ substantially, so you 
need to caiefully choose which measure of die average to report. Data for 
peisonal income is a good example of tins. Personal income data tend not 
to occui in the typical shape of a bell curve. Ratiier, there are enough high 



incomes at die top of die distribution to skew the mean, so die median is 
a better reflection of the typical income. 

In Chapter 9, we noted tiiat computing descriptive statistics for time- 
series data is not always helpful when those data increase or decrease over 
time. In these cases, one can look at average growth rates over time, in¬ 
stead of average values. Suppose we return to die Energy Crisis example. 
Over the time period in question one could ask, what were the percentage 
changes in energy prices and energy consumption? From 1973 to 1980, for 
example, the real price of energy increased by 91 percent while energy 
consumption per dollar of GDP decreased by 23 percent. This implies a 
negative relationship, as die law of demand suggests. 

You can more formally assess the relationship between two variables by 
examining the covariance. The covariance is a measure of how two vari¬ 
ables vary together. It is related matiiematically to the standard deviation 
and die variance. Unfortunately, die covariance is difficult to explain in¬ 
tuitively. For diat reason, researchers often use a closely related concept, 
correlation. Correlation measures the degree of linear association be¬ 
tween two variables. The con-elation coefficient, which ranges in value 
from —1 to +1, measures botii die direction and strengdi of the linear re¬ 
lationship. Suppose we graphed energy prices against energy consump¬ 
tion, and drew a line to best fit die data points. The slope of the line would 
show the (hypothesized) linear relationship between prices and con¬ 
sumption. The slope would be positive or negative. The strength of the re¬ 
lationship would be reflected by the dispersion of die data points—die 
degree to which die data points are close to or far from the line. If die av¬ 
erage dispersion is small, then most of the data points will be found “close” 
to the line. In this case, we say the correlation between prices and con¬ 
sumption is high. The higher the correlation, the closer the coefficient is 
to +1 for positive relationships or —1 for negative relationships. If the av¬ 
erage dispersion is large, then most of the data points will be “far” from 
die line. In this case, we say the correlation between prices and consump¬ 
tion is low. The lower the correlation, the closer the coefficient is to zero. 
For example, the correlation coefficient between real energy prices and 
consumption per dollar of GDP over diis time period was —0.95, which in¬ 
dicates a strong negative linear relationship. 

Random Variation in Human Behavior 

Data analysis is subject to diree major problems. These problems are, 
first, the effects of random variation in human behavior; second, the fact 
that relationships between two variables can be concealed by the effects of 
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other variables; and, third, causal validity, tire fact drat correlation is not 
die same as causation. We will explore each of these issues in tire following 
sections. 

One of the difficulties faced by research in tire social sciences is random 
variation in human behavior. This poses a problem for relationship valid¬ 
ity because the effects of random variation can obscure any underlying re¬ 
lationships. Let’s illustrate this with an example. Suppose that consumer 
spending (C) is based systematically only on income (Y). Algebraically, this 
is represented as follows: 

C = bY, (1) 

where b is a positive constant showing the relationship between C and Y, 
drat is, the slope of the graph of C against Y. 

Additionally, however, suppose that in any time period individuals in 
fact spend a random amount more or less that that. As a consequence, con¬ 
sumer spending is actually 

C = bY + e, (2) 

where e reflects the random variation in spending. 

If we plot the data, they might look something like Figure 10.2. 


jFigure 1 0.2 Sample of Data for Consumer Spending and Income 
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If we imagine a line through tire points in Figure 10.2, it would have a 
negative slope, indicating a negative relationship between C and Y. But are 
they negatively related? Most social science statistics are based on sample 
data rather titan population data. The key question to ask about any data 
set is whether or not it fairly represents the underlying population data. 
Recall that in Chapter 4 we pointed out that for an argument to be per¬ 
suasive the evidence presented must be representative. This is an example 
of that point. There are really two issues at stake here: First, is the sample 
large enough to validly show any underlying relationship in the data? Sec¬ 
ond, is the sample random? These issues are related, but the second is the 
more critical. 

Suppose consumption and income are positively related, as de¬ 
scribed by Equation 2. If we take a large random sample of the data, 
then any data point with a positive error is likely to be matched by an¬ 
other point with a comparable negative error. As a result, the errors will 
tend to cancel each other out, and the data will show the true underly¬ 
ing relationship. How large a sample is large enough? A common rule 
of thumb is that a sample size of thirty or more will do the trick. Strictly 
speaking, the sample should be thirty or more random data points. A 
truly random sample is difficult and expensive to acquire, so typically 
samples are less than random. 5 A random sample is also unlikely in 
macro research, where time-series data are the norm. In either case, 
bias is introduced into the results. 

So where does this leave us? It may be that the data in Figure 10.2 were 
not a random sample. Thus, the graph may look negatively sloped when 
the underlying relationship is the opposite. The reason for this anomaly is 
that some chance exists that any data sample will include a nonrandom se¬ 
lection of errors, so the data will appear to show a pattern that does not re¬ 
flect the underlying population. In other words, the sample of data may 
not be large enough or random enough for the errors to cancel each other 
out. As a result, consumer spending will appear to show a negative rela¬ 
tionship with income when drat appearance is really due to a chance se¬ 
lection of errors. This is called a sampling error. 

This point is illustrated in Figure 10.3, where the original sample is 
shown in solid dots, while the remainder of the population is shown in 
empty dots. Note that a line drawn through the population shows the cor¬ 
rect positive relationship between the two variables. 

In practice, the problem may be even worse than we’ve indicated. All 
data are subject to some degree of measurement error, which is an issue of 
instrument validity. This measurement error will also be reflected in the e 
term of Equation 2. 
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Statistical Methods 

The way to deal with random variation in human behavior is to incor¬ 
porate statistical methods to determine the likelihood of a sampling error. 
Let’s illustrate how this can be done by continuing the example from the 
last section, which hypothesizes a relationship between consumption and 
income (Equation 2). 

When employing statistical methods it is critical to differentiate be¬ 
tween two concepts: the null hypothesis and the alternative or maintained 
hypothesis. The maintained hypothesis is the theoretical prediction of a 
model. Generally speaking, however, when economists use statistical meth¬ 
ods to test a theory, they actually test the null hypothesis. For this reason, 
the null hypothesis is sometimes referred to as the statistical hypothesis, 
that is, the hypothesis that is actually being tested statistically. If the null 
hypothesis is rejected by the evidence, then that result supports or is con¬ 
sistent with tire maintained hypothesis. In Equation 2, we are estimating 
the value of b, tire relationship between income and consumption. Thus, 
tire null hypothesis, which is actually examined statistically, is that b is zero, 
while the maintained hypothesis is that b is not zero. 

Even if the null hypothesis is true, there is a chance that data for a given 
sample -will show a relationship so that an estimate of b will not be zero. In 


[Figure 10.4 Distribution of Observed "b "for All Possible Data Samples 
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fact, tire outcomes that we would observe from all possible samples of the 
data tend to be distributed along a bell curve, as shown in Figure 10.4. You 
should notice a few things about this bell curve. First, tire height of tire 
curve at any point reflects tire probability of that observation occurring. 
Second, the bell curve is symmetric around a value of b = zero, meaning 
no relationship. Thus, it should be clear that observations closer to zero 
(i.e., the null hypothesis) will be more likely to occur than observations 
that are further from zero. This makes sense since we are assuming that 
the null hypothesis is true. The further from zero that a given observation 
is, however, the less likely it is to appear. At some point, the probability is 
so small, the likelihood of observing it is so low, that we reject the null hy¬ 
pothesis in favor of the alternative that the observed relationship is not 
due to random chance, but rather reflects a real relationship. Such a re¬ 
sult would support the maintained hypothesis. 

How far away is enough? How certain do you need to be to reject the 
null hypothesis in favor of the alternative? The answer to this question is 
called the level of significance. The level of significance is the risk that the 
researcher is wiling to take that tire null hypothesis will be rejected when 
it is time. Another way to think about significance level is that it is tire prob¬ 
ability that tire researcher will accept tire hypothesis he or she is main¬ 
taining when it should be rejected—in other words, that the researcher 
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Type 1 and Type 2 Errors 

Statisticians formalize this discussion of statistical inferences by 
talking about Type 1 and Type 2 errors. Wrongly rejecting the null 
hypothesis is called a Type 1 error. To illustrate this, consider a 
person who is on trial for a robbery. In the U.S. legal system, the 
defendant is assumed to be innocent until proved guilty. Thus, the 
statisticians would say that the null hypothesis is that the defendant is 
innocent. Suppose she is indeed innocent, but that the jury convicts 
her anyway. In this case, they have committed a Type 1 error. 
Researchers can reduce the likelihood of a Type 1 error by using a 
smaller level of significance—that is, by requiring a higher standard of 
proof before accepting that a relationship is valid. Wrongly accepting 
the null hypothesis is called a Type 2 error. To illustrate this case, 
suppose that the defendant in our example is in fact guilty. If the jury 
finds her innocent, they would be committing a. Type 2 error. 
Researchers can reduce the likelihood of. a Type 2 error by increasing 
the sample size. 


wrongly accepts the maintained hypothesis. Most researchers are willing to 
use a 5 percent level of significance, though a 1 percent level would be 
even more convincing. To sum up, if you use a 5 percent level of signifi¬ 
cance, then you are willing to risk a 5 percent chance that you will reject 
the null hypothesis when it is true; consequently, there is a 95 percent 
chance that you will not reject the null hypothesis when it is true. 6 

Simple Statistical Hypothesis Testing 

Let’s illustrate simple statistical hypothesis testing with an example, us¬ 
ing a statistical “West.” Simple statistical hypothesis testing is one case of 
what is known more generally as statistical hypothesis testing. Suppose 
that we Want to know whether the mean value of some sample is signifi- 
candy different from some standard called the default mean. For example, 
we could ask if the average unemployment rate among high school 
dropouts is different from that of tire labor force in general. A statistical 
West can be used to answer this question. 7 A West examines whether the 
sample mean (in this case, the average unemployment rate among 
dropouts) is far enough away from the default mean (i.e., the average un¬ 
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employment rate among all people) to be considered statistically differ¬ 
ent. Let’s examine how this can be done. 

The first step is to compute the estimated t statistic using the following 
formula: 

t = (x — |i)/(s/Vn) (3) 

where 

o x is tire average unemployment rate among high school dropouts 
(i.e., the sample mean), 

• |x is the unemployment rate in the general labor force (i.e., the de¬ 
fault mean), 

® s is the standard deviation of the unemployment rate among high 
school dropouts (i.e., the sample standard deviation), and 

• Vn is the square root of n, the sample size of the data set. 

Note also that s/Vn is called the standard error of fire average unem¬ 
ployment rate among high school dropouts. 

The null hypothesis to be examined is that the mean unemployment rate 
among dropouts is not significantly different from that of the labor force in 
general. This is analogous to our hypothesis of “no relationship” in the pre¬ 
vious section. You select the level of significance that you are willing to toler¬ 
ate (e.g., 5% or 1%). Then you compare the estimated t with the critical t 
value for the desired level of statistical significance. 8 Exact critical t values can 
be obtained from tables in statistics books, such as that in Appendix 10A, or 
online (e.g., at http://www.statsoftinc.com/textbook/sttable.html). 

If the estimated t exceeds the critical t, then you can conclude that the 
probability of the sample mean being the same as the default mean is low. 
Thus, it is likely that any observed difference is not due to random chance; 
rather, the observed difference is likely to be real. 

Monthly data from January 1993 to October 2003 were obtained from 
the Bureau of Labor Statistics’ webpage (www.bls.gov) to construct this t- 
test. The data were downloaded into an Excel spreadsheet, and the fob 
lowing statistics were computed: 

• x (the average unemployment rate among high school dropouts) = 
8.26% 

• |x (the unemployment rate in tire general labor force) = 5.28% 

• s (the standard deviation of the unemployment rate among high 
school dropouts) = 1.37 

• n (the sample size of tire data set) = 130 monthly observations, so 
Vi = 11.40 

0 s/Vi (the standard error) =0.12 
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Using the formula given in Equation 3, we can construct our observed 
t statistic: 

t= (8.26 - 5.28J/0.12 = 24.83 

The critical t value for this test, using a 5 percent significance level, is 1.98. 
Since the observed t, 24.83, exceeds the critical t, 1.98, we conclude that 
high school dropouts have a statistically different unemployment rate than 
the general population. 

Wyrick (1994, 195) provides a good intuitive explanation of f-tests. He 
points out that a ^-statistic “is a ratio between the observed . . . difference 
[e.g., between unemployment rates] and the normal variations ... [in un¬ 
employment rates] caused by all factors.” Wyrick argues that “if the ratio 
is ‘large,’ then tire observed . . . difference exceeds the amount that one 
might attribute to chance fluctuations alone.” Thus, one can reasonably 
conclude that there is indeed a difference in unemployment rates. Alter¬ 
native, “if the ratio is “small” then the observed . . . difference is “small” 
compared to normal . . . fluctuations, so it would be unwise to conclude 
that the observed difference is meaningful.” 

An Alternative Approach: The P-Value 

An alternative approach to performing Wests is increasingly being used. 
This is called the p-value. The p-value is the exact or observed level of sig¬ 
nificance. More formally, assuming that the null hypothesis is true, the p- 
value is the probability that you would obtain an estimated t score that is 
as large or larger than the one you obtained, in tire direction of the main¬ 
tained hypothesis. If the observed probability is less titan the level of sig¬ 
nificance, the null hypothesis is rejected. 

One advantage of the p-value approach is that it doesn’t require con¬ 
sulting a table of critical t-values. Since die p-value approach is equivalent 
to the f-test, most statistical software packages provide p-values. 

Confounding Variables 

We stated earlier that the second problem of data analysis is the fact that 
relationships between two variables can be concealed by the effects of 
other variables. Statisticians call this problem tire confounding of ex¬ 
planatory variables. It affects both casual empiricism and simple Wests. 
Even with a random sample of data, it is possible to obtain a sample that 
looks like Figure 10.2. Suppose that the variable of interest (e.g., consumer 
spending in the previous example) is affected by something other than the 
hypothesized cause (i.e., income). Examples of plausible other variables 
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include interest rates or household wealth. Let’s amend our equation for 
consumer spending as follows: 

C = b]Y + b 2 i + e . (4) 

Now suppose that during the sample period interest rates noticeably de¬ 
creased, while income didn’t, hr this case, tire data would reflect the effects 
of interest rates on consumption more than the effects of income. This sit¬ 
uation is illustrated in Figure 10.5, where tire data sample is reflecting a shift 
in tire consumption function rather than a movement along the function. 
To accurately measure the relationship between consumer spending and 
income, we must control for any other variables, like interest rates, that 
have their own effect on spending. Without this control, visual or statisti¬ 
cal examination of the data can be flawed because the impacts of Y and l 
can’t be separated. This is exactly the basis for tire ceteris panbus ( all 
other tilings being equal”) assumption that economists commonly make. 

Thus a control variable is what is held constant in a study. Some¬ 
times, judicious test design can solve this problem. For example, studies 


Figure 1 o.s The Effects of Interest Rates on the Relation Between C and Y 
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Using the formula given in Equation 3, we can construct our observed 
f statistic: 

t= (8.26 - 5.28J/0.12 = 24.83 

The critical lvalue for this test, using a 5 percent significance level, is 1.98. 
Since tire observed f, 24.83, exceeds the critical f, 1.98, we conclude that 
high school di opouts have a statistically different unemployment rate than 
the general population. 

Wyrick (1994, 195) provides a good intuitive explanation of f-tests. He 
points out that a f-statistic “is a ratio between tire observed . . . difference 
[e.g., between unemployment rates] and tire normal variations ... [in un¬ 
employment rates] caused by all factors.” Wyrick argues that “if the ratio 
is large, then tire observed . . . difference exceeds the amount that oire 
might attribute to chance fluctuations alone.” Thus, one can reasonably 
conclude that there is indeed a difference in unemployment rates. Alter¬ 
native, if the ratio is small’ then the observed . . . difference is “small” 
compaied to normal . . . fluctuations, so it would be unwise to conclude 
that the observed difference is meaningful.” 

An Alternative Approach: The P-Va/ue 

An alternative approach to performing f-tests is increasingly being used. 
This is called the p-value. The p-value is tire exact or observed level of sig¬ 
nificance. More formally, assuming that the null hypothesis is true, the p- 
value is the probability that you would obtain an estimated f score that is 
as large or larger than the one you obtained, in tire direction of the main¬ 
tained hypothesis. If the observed probability is less than the level of sig¬ 
nificance, the null hypothesis is rejected. 

One advantage of tire p-value approach is that it doesn’t require con¬ 
sulting a table of critical f-values. Since the p-value approach is equivalent 
to the f-test, most statistical software packages provide p-values. 

Confounding Variables 

We stated earlier that the second problem of data analysis is the fact that 
relationships between two variables can be concealed by tire effects of 
other variables. Statisticians call this problem the confounding of ex¬ 
planatory variables. It affects botir casual empiricism and simple f-tests. 
Even with a random sample of data, it is possible to obtain a sample that 
looks like Figure 10.2. Suppose tirat the variable of interest (e.g., consumer 
spending in the pr evious example) is affected by something other than the 
hypothesized cause (i.e., income). Examples of plausible other variables 



include interest rates or household wealdr. Let’s amend our equation for 
consumer spending as follows: 

C = bjY + b 2 i + e ( 4 ) 

Now suppose that during die sample period interest rates noticeably de¬ 
creased, while income didn’t, hi this case, die data would reflect the effects 
of interest rates on consumption more than the effects of income. This sit¬ 
uation is illustrated in Figure 10.5, where the data sample is reflecting a shift 
in die consumption function rather than a movement along die function. 
To accurately measure the relationship between consumer spending and 
income, we must control for any other variables, like interest rates, that 
have their own effect on spending. Without this control, visual or statisti¬ 
cal examination of the data can be flawed because die impacts of Y and i 
can’t be separated. This is exactiy the basis for die ceteris paribus (“all 
other things being equal”) assumption that economists commonly make. 

Thus, a control variable is what is held constant in a study. Some¬ 
times, judicious test design can solve this problem. For example, studies 

;Figui|T0L5 The Effects of Interest Rates on the Relation Between C and Y 
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NOTES FOR NOVICE RESEARCHERS 

What's an Appropriate Empirical Test? 

The researcher should choose an appropriate empirical test that is 
based on the specific objectives of his or her research, considering 
both the costs and benefits of the testing methodology. This does not 
mean that the most complex or sophisticated methods are always the 
best. (Regression may not always be necessary!) 

As a general rule, you should use the simplest testing technique 
that adequately controls for outside factors. Most importantly, 
remember that the validity of your research design is never higher 
than the validity of any one part of that design. Thus, if your data are 
suspect, the most sophisticated statistical techniques cannot improve 
upon them. 


that confirms that analysis. Though suivey research cannot prove a hypothe¬ 
sis in the way that experimental research can, multiple survey-based studies 
using different data sets at different times by different researchers can pro¬ 
vide strong evidence of causality in a hypothesis. 

SUMMARY 

0 The purpose of empirical testing is to assess the validity of one’s theo¬ 
retical analysis of an issue or problem. 

* The validity of an empirical study has numerous aspects: internal versus ex¬ 
ternal validity, instrument validity, relationship validity, and causal validity. 

0 A study’s validity can be compromised by three hurdles of data analysis: 

1. Random variation in human behavior 
Solution: Tests of statistical significance 

2. Dependent variable can be affected by other than the hypothesized 
independent variable 

Solution: Control variables 

3. Correlation is not causation 

Solution (s): Sound theoretical development; Multiple studies with 
different methodologies 
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NOTES 

1. Note that when we say X “causes” Y we are assuming that all other pos¬ 
sible explanations for the observed outcome have been controlled for. 
This is a strong assumption since there is always the possibility that we 
failed to identify some possible explanation. We will discuss these issues 
m more detail later in the chapter. 

2. One should also consider tire reliability and sensitivity of the instruments. 

3. Oftentimes, however, lack of adequate data will limit your ability to use 

a more powerful test. This highlights again die importance of finding a 
good data set. 6 

4. For even-numbered sample sizes, the median is the number at which 
half the sample is above and half the sample is below. In other words it 
is the number halfway between the two middle numbers in the sample. 
For odd-numbered sample sizes, the median is exacdy the middle num¬ 
ber in the sample. 

5. Best (2001) points out that social science research often uses “conve¬ 
nience sampling,” that is, using the sample that comes to you rather 
than going to the trouble of developing a random sample. 

6. Note that this doesn’t say anything direcdy about when the null hy¬ 
pothesis is false! Statistical testing examines the null hypothesis that 
there is no statistical relationship. More precisely, it attempts to disprove 
the null hypodiesis. The level of significance defines how much confi¬ 
dence one can have that the null hypothesis has been correcdy rejected. 

It does not say anything direcdy about how much confidence one can 
place in the maintained hypothesis. 

7. For the West to be appropriate the data should be a normally distrib¬ 
uted, random sample. 

8. Critical lvalues depend on three factors: die desired level of statistical 
significance, whether die testis one tailed or two tailed, and the degrees 
of freedom. A one-tailed test would examine either of two hypotheses: 
the sample mean (x) is less than die default mean (p,), or the sample 
mean is greater than die default mean. By contrast, a two-tailed test 
would examine the hypothesis that the sample mean (x) is not equal to 
die default mean (|x). Thus, it is a more general test. The degree of free¬ 
dom is defined as n - 1, where nis the sample size. In our example, the 
degrees of freedom are 130 - 1 = 129. 



EXERCISES 

1. Collect twenty years of annual data for U.S. inflation rates and the 
growth rate of die money supply (Ml). For the inflation rate and die 
money-growth rate compute die mean, median, and mode, as well as 
the standard deviation and range of values. Plot die two variables on a 
graph. Do you see any relationship between die two variables? Compute 
die correlation between inflation and money growth. 

2. What are the key variables in your research hypodiesis, diat is, your de¬ 
pendent variable and die hypodiesized variables diat affect it? Collect 
twenty data points for each variable. Compute die mean, median, and 
mode, as well as the standard deviation and range of values for each 
variable. Plot the dependent variable with each key independent vari¬ 
able on a graph. Do you see any relationship between die two variables? 
Compute the correlation between the two variables. 

3. Is die unemployment rate in your state statistically different from die 
national unemployment rate over the past ten years? Construct a tf-test 
to assess this hypothesis. 

4. Think about your research hypothesis. What variables odier than the 
key variables diat you identified in exercise 2 might influence the de¬ 
pendent variable and tiius need to be controlled for in your empirical 
testing? 

SUGGESTIONS FOR FURTHER READING 

Best (2001) —Very tiioughtful monograph on how data series are con¬ 
structed and die ways diey can distort what they purport to measure. 

Huff (1954), especially Chapters 3, 4, and 8 —Excellent intuitive explanation 
of statistical significance. 

McCuen, Johnson, and Davis. (1993) —-Very readable introduction to techni¬ 
cal writing for scientists; see especially Chapter 4, Communicating 
Technical Data and Statistics. 

Phillips (1996) —Very tiioughtful, intuitive guide for users of statistical 
methods. 

Wyrick (1994) —Guide to regression analysis for undergraduates. 
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Table of Critical t-Statistics 


Level of Significance 
One-Sided Test 




.10 

.05 

.01 

Degrees 

1 

3.078 

6.315 

31.82 

of 

2 

1.886 

2.920 

6.965 

Freedom 

3 

1.638 

2.353 

4.541 


4 

1.533 

2.132 

3.747 


5 

1.476 

2.015 

3.365 


6 

1.440 

1.943 

3.143 


7 

1.415 

1.895 

2.998 


8 

1.397 

1.860 

2.896 


9 

1.383 

1.833 

2.821 


10 

1.372 

1.812 

2.764 


11 

1.363 

1.796 

2.718 


12 

1.356 

1.782 

2.681 


13 

1.350 

1.771 

2.650 


14 

1.345 

1.761 

2.624 


15 

1.341 

1.753 

2.602 


16 

1.337 

1.746 

2.583 


17 

1.333 

1.740 

2.567 


18 

1.330 

1.734 

2.552 


19 

1.328 

1.729 

2.539 


20 

1.325 

1.725 

2.528 


21 

1.323 

1.721 

2.518 


22 

1.321 

1.717 

2.508 


23 

1.319 

1.714 

2.500 


24 

1.318 

1.711 

2.492 


25 

1.316 

1.708 

2.485 


26 

1.315 

1.706 

2.479 


27 

1.314 

1.703 

2.473 


28 

1.313 

1.701 

2.467 


29 

1.311 

1.699 

2.462 


30 

1.310 

1.697 

2.457 


00 

1.282 

1.645 

2.326 



Source: This table was generated using tire Quantile, StudentTDistribu- 
tion, and NormalDistribution commands in Mathematica. 
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Introduction to 
Regression Analysis 

"You can use all the quantitative data 
you can get, but you still have to distrust 
it and use your own intelligence 
and judgment." 

ALVIN TOFFLER 


inigii 

I 

1 n the last chapter, we noted that regression analysis is the most widely 
;ij used method by which economists test hypotheses. For that reason, 
Jj it deserves a chapter of its own. This chapter won’t substitute for an 
IlisBleconometrics course, however. It is more like a course in music ap¬ 
preciation; it won’t make you a musician, but it will enable you to under¬ 
stand and respect certain types of music and perhaps even make some 
music yourself. The chapter has two modest goals: 


8 To enable you to understand published regression results well 
enough to determine whether they support a hypothesis, and 
* To provide you with a guide to performing simple regressions to test 
your own research hypotheses. 


Steps in Regression Analysis 

Regression analysis is a means of testing hypotheses while controlling 
for other factors that could influence the variable you are primarily inter¬ 
ested in. A regression analysis can be broken down into five steps, as illus¬ 
trated in Figure 11.1. We explore these steps in more detail in the sections 
that follow. 
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Figure n.i Steps in a Regression Analysis 


Step 1: State the Hypothesis 

Step 2: Test the Hypothesis, i.e.. Estimate the Relationship 
Step 3: Interpret the Test Results 

• To what extent do the coefficient estimates conform to the maintained 
hypothesis identified in Step 1 ? 

6 Are the coefficient estimates statistically significant? 
s Are they economically significant? 

• Are the coefficient estimates plausible for the real world, consistent 
with economic theory, and within the range of previous estimates? 

• How "good a fit" is the overall regression model? 

Step 4: Check for and Correct Common Problems of Regression 
Analysis 

Step 5: Evaluate the Test Results 

• On balance, to what extent do the results support the hypothesis? 


Step 1: State the Hypothesis 

The first step in a regression analysis is to state the maintained hypoth¬ 
esis. Suppose you observed the substantial increase in equity prices during 
the 1990s and wondered to what extent that increase might have con¬ 
tributed to the consumer spending that powered the economic expansion 
in tire same period. You could hypothesize that stock market wealth 
(SMW) has a positive effect on consumer spending: 

C = f[SMW], where f' >0 (1) 

Researchers call C the dependent variable, in other words, the concept 
we are trying to explain. By contrast, SMW is an explanatory or indepen¬ 
dent variable, that is, a concept we hypothesize that explains or causes the 
dependent variable. 

Traditionally, analysts have assumed that a linear relationship exists be¬ 
tween the dependent and independent (explanatory) variables. 1 This as¬ 
sumption is justified on two bases: first, nonlinear estimation is extremely 
difficult and time consuming to perform, and, second, even if a relation¬ 
ship is nonlinear, one can often approximate it with a linear relationship. 
The latter point is illustrated in Figure 11.2, which shows a plot of inflation 
versus unemployment. The function that best fits the data would be a 
curve rather than a line, but the line comes fairly close to representing the 
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Figure 11.2 Linear Approximation to a Nonlinear Function 
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function. However, the growing availability of computer programs that can 
perform nonlinear calculations has made nonlinear regressions more 
common. Nonetheless, linear regression is still the norm, and we will use 
it here. 2 

If we assume a linear relationship between stock market wealth and con¬ 
sumer spending, we can express the hypothesis in algebraic form as follows: 

C = a + b SMW (2) 

If you think about this in terms of a line, then b shows how much C 
changes as a result of a change in SMW, assuming nothing else changes at 
the same time. In other words, b shows the slope of the line. Similarly, a 
represents the vertical intercept of the line. Mathematically, it shows what 
C will be when SMW is zero; economically, it shows the effects of every¬ 
thing besides SMW on C. Because there are only two explicit variables in 
this hypothesized relationship, we call it a bivariate regression. 

So far, we have not discussed one of the major advantages of regression 
analysis: it gives us the ability to control for the effects of confounding vari¬ 
ables. Though we are primarily interested in the relationship between 
stock market wealth and consumer spending, economic theory suggests 
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other variables may affect consumer spending too, for example, income 
(Y) and perhaps other forms of wealth (OW). We can easily incorporate 
additional variables into tire model as follows: 

C = a + b 2 SMW + b 2 Y + b 3 OW (3) 

Most regression studies, like this one, are multiple (or multivariate) re¬ 
gressions, since the dependent variable is influenced by multiple ex¬ 
planatory variables. 

Once we have formally stated the hypothesis in the form of Equation 
3, it is important to identify our theoretical predictions. Note that there 
is a maintained hypothesis (or theoretical prediction) for each ex¬ 
planatory variable in the model. In our stock market wealth example, 
the main hypothesis predicts that higher stock market wealth should 
lead to higher consumer spending. Thus, bj should be a positive num¬ 
ber. The researcher should also identify the predictions for the control 
variables. For example, one might expect that the effects of income and 
other forms of wealth on consumer spending, shown by the values of b 2 
and b 3 , would be positive. It is good scientific practice never to run a re¬ 
gression without explicitly identifying the results that would confirm tire 
hypothesis. 

Step 2: Test the Hypothesis; i.e.. Estimate the Relationship 

The next step in regression analysis is to analyze the data on the de¬ 
pendent and explanatory variables for any possible relationship between 
them. We observed in Chapter 10 that even if tire maintained hypothesis 
is true, each observed data point includes an error caused by random 
chance, measurement error, or human subjectivity. Thus, the multiple re¬ 
gression model becomes the following: 

C = a + b 3 SMW + b 2 Y + b 3 OW + e. (4) 

A plot of the data for C and SMW, shown in Figure 11.3, displays a scat¬ 
ter of data points with an upward slope. 3 

Though Figure 11.3 seems to indicate a positive relationship, this con¬ 
clusion is complicated by two factors. First, we don’t know what the er¬ 
rors are for each observation. Second, we only have a sample of the data, 
rather than all the data, so it’s unlikely that the errors will cancel out. As 
a consequence, we can’t simply measure the underlying relationship. In¬ 
stead, it is necessary to estimate the relationship. This is what regression 
is all about. More specifically, we make certain assumptions about the er¬ 
ror, which allow us to make estimates of the underlying relationship. 4 Vi¬ 
olating any of the standard assumptions will require us to use different 
estimation techniques. 
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Figure 11.3 Plot of Data for Consumption and Stock Market Wealth 



Stock Market Wealth 


Regression analysis asks tire question: what line “best” fits the data (for 
example, in Figure 11.3). Recall that the equation of any line is defined by 
specific values for tire slope and intercept. Thus, the regression process 
chooses values for the slope and intercept so that the estimated relation¬ 
ship best fits the data. The various estimation techniques used by econo¬ 
mists deal with defining “best” given the specific data at hand. 

In short, the purpose of regression analysis is to generate estimates of 
the relationship between the dependent variable (e.g., C) and each of the 
explanatory variables (e.g., SMW, Y, OW). Because each of these relation¬ 
ships is estimated with all the others held constant, they are analogous to 
partial derivatives in calculus. These estimates are called estimated para¬ 
meters or estimated coefficients, and they are the primary product of the 
regression analysis. Fortunately, we no longer have to do this by hand. In¬ 
stead, a variety of computer programs are available (e.g., Excel, E-Views, 
SPSS, SAS) that perform regression analysis. 

Let’s look at an example. Figure 11.4 shows die results gained from esti¬ 
mating the relationship between real consumer spending (CONS) and two in¬ 
dependent variables: stock market wealth as measured by the Wilshire 5000 
stock market index and real personal disposable income (REAL_PDI). We will 
use this as our example as we discuss various aspects of regression analysis. 
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Figure ii.4 Sample Regression Results 

Dependent Variable: CONS 
Sample: 1990:4 2002:1 
Observations: 46 


Variable 

Coefficient 

Std. Error 

t Statistic 

Prob. 

C 

-781.48 

178.73 

-4.37 

0.0001 

W5000 

0.0096 

0.0067 

1.43 

0.1587 

REAL_DPI 

1.0419 

0.0389 

26.8 

0.0000 


R-Squared: 0.9939 

Adjusted R-Squared: 0.9936 
Durbin-Watson: 1.4391 

F: 3519.2 


Whatever statistical software you use, the results will include the coeffi¬ 
cient estimates, shown in the second column of Figure 11.4, and a variety 
of test statistics, which can be used to evaluate the text results. We will dis¬ 
cuss how to do this in the next section. 

Step 3: Interpret the Test Results 

Probably tire most important step in regression analysis, and the step 
novice researchers put the least effort into, is interpreting the results. In¬ 
terpreting regression results is not a mere mechanical exercise. To be valid, 
regression requires judgment on the researcher’s part. Recall that the pur¬ 
pose of regression is to assess the validity of your theoretical analysis. Is the 
empirical evidence consistent with your maintained hypotheses? Does it 
support the hypotheses? To answer these questions, it is necessary to eval¬ 
uate tire regression results. As you do so, it is important to remember the 
point made in Chapter 10—researchers typically test the null hypothesis, 
that there is no relationship. Thus, tire statistical hypothesis is the opposite 
of the theoretical prediction. 

Start with Your Theoretical Predictions To begin interpreting the re¬ 
sults, you need to remind yourself what evidence you hoped to see to con¬ 
firm your hypothesis. We introduced this in Step 1. Did you expect to see 
a positive or negative relationship between each explanatory variable and 
the dependent variable? Did you have any expectations about the magni¬ 
tudes of each coefficient? The simple quantity theory of money illustrates 
this well, since it makes predictions not only about whether the coefficient 
estimates are positive or negative but also about their size. 
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Figure 11 .s Sample Regression Results 



a 

b 

Theoretical Prediction 

0.0 

1.0 

Estimated Value 

0.004 

0.91 

Estimated t Statistics 

0.2 

2.4 

Estimated p Value 

0.84 

0.02 


The quantity theory was the theory of aggregate demand used by the clas¬ 
sical economists. The classical economists assumed that aggregate de¬ 
mand could have no effect on the levels of employment or output in the 
economy. As such, they hypothesized that the inflation rate (Pi) should be 
same as tire growth rate of the money supply (%AMs) and that, assuming 
no economic growth or changes in the velocity of money, the rate of in¬ 
flation would be affected by nothing else. This hypothesis can be ex¬ 
pressed in tire form of an equation: 

Pi = a + b %AM S (5) 

In short, the quantity theory predicted that a (the vertical intercept in 
a regression) would be zero and that b would be positive one. 

To What Extent Do the Coefficient Estimates Conform to Your 
Theoretical Predictions? 

Once you have clearly stated your theoretical predictions, it is time to 
examine the coefficient estimates. Consider the predicted and estimated 
values of the coefficients for tire quantity theory of money, as shown in Fig¬ 
ure 11.5. 

What are the algebraic signs of tire estimated coefficients? In Figure 
11.5, we see that the coefficient on the money growth term (i.e., b) is pos¬ 
itive (+0.91). But we cannot take that positive sign as conclusive evidence. 
Recall our discussion from Chapter 10 that the empirical results from a 
given data sample could be the result of random chance rather than of the 
hypothesized relationship. 

Regression software typically reports several test statistics to help the 
user assess this possibility. For each coefficient estimate b, the software 
computes a t statistic using the formula we introduced in Chapter 10: 

t = (x - p.)/(s/Vn), (6) 

where b takes the role of x in this equation, and p, is now the true value of 
tire coefficient. The specific hypothesis tested by the regression software is 



Quick Rule off Thumb for the Critical t 


A commonly used rule of thumb for the critical t statistic is that it is 
2.0. This is because as the sample size increases the critical t declines 
from a little above 2.0. You should confirm this by looking at the t 
tables in Appendix 10A. Thus, for almost any level of significance and 
sample size, if the estimated t is greater than 2.0 then it will be greater 
than the exact critical t, and therefore the estimated coefficient will be 
statistically different from zero. 


whether the estimated coefficient is statistically different from zero. Thus, 
using the language from Chapter 10, the null hypothesis is b = 0, while the 
theoretical prediction (or tire maintained hypothesis) is b # 0. Substitut¬ 
ing the null hypothesis into the preceding equation yields: 5 

t= '(ft-0)/(s/V5) . (7). 

t = b/ SE, (8) 

where: 

• b is the estimated coefficient between consumer spending and in¬ 
come, and 

• SE is the standard error of that estimated relationship, defined as the 
standard deviation of the independent variable divided by the square 
root of the sample size of the data set. 

If the estimated t exceeds the critical tfor the desired significance level, 
their we reject the null hypothesis and conclude that the coefficient is sta¬ 
tistically different from zero (or statistically significant). Thus, the re¬ 
ported algebraic sign is likely valid. 

By contrast, if the estimated t does not exceed the critical t, we can’t re¬ 
ject the null hypothesis. Thus, it is likely that any observed relationship is 
simply due to random chance, so the reported algebraic sign cannot be ac¬ 
cepted as valid. 

Regression software also computes a p value, which provides an alter¬ 
native approach to evaluating t statistics. The p value is the probability 
that you would obtain the estimated t score that you obtained if the null 
hypothesis is assumed to be true. If the p value is sufficiently small, or 
more precisely smaller than the level of significance, one can reject the 
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NOTES FOR NOVICE RESEARCHERS 

Assessing the Statistical Significance 
off an Estimated Coefficient 

Our hypothesis in the example shown in Figure 11.4 was that 
consumer spending during the 1990s was driven by the booming stock 
market. To assess this hypothesis, let’s examine the coefficient 
estimate for "W5000," which measures the relationship between the 
Wilshire 5000 stock market index and consumer spending. The 
estimated coefficient on W5000 is 0.0096, and the standard error is 
0.0067. Using the formula given in Equation 8, we can confirm that 
t = b/SE = 0.0096/0.0067 = 1.43. The degrees of freedom for this test 
are n - 1 = 46 — 1 = 45. For a 95 percent level of confidence and a 
one-tailed test, the critical t is 1.68. 6 Since the estimated t is less than 
the critical t (1.46 < 1.68), we conclude that the estimated coefficient 
is not statistically different from zero. Thus, our hypothesis is not 
confirmed by the data—-that is, the hypothesis fails. 



null hypothesis and conclude that the coefficient estimate is statistically 
significant. 

For the example in Figure 11.5, suppose the critical t is 2.0. Since the 
estimated ds 2.4, which is greater than the critical t, we can conclude that 
a positive relationship does appear to exist between inflation and money 
growth, which is consistent with the hypothesis of classical economic the¬ 
ory. Alternatively, using the p value approach, since the p value is 0.02, 
which is less than the level of significance, 0.05, we reject the null hypoth¬ 
esis and conclude that the estimated coefficient is statistically different 
from zero. 

Now what about the size of the coefficient estimates? Some hypotheses 
make very specific predictions about the magnitudes of the coefficients. 
For example, the quantity theory predicts that the constant term, a, will be 
zero and that & will be one. Even if a hypothesis does not make specific pre¬ 
dictions about the size of the relationships, the coefficient estimates are 
important for several reasons, which we will discuss shortly. 

In assessing the magnitudes of coefficients, we must consider several is¬ 
sues. In empirical work, it is very unlikely that the estimated coefficients 
will exactly match the predictions. Indeed, if they did, you should be very 












suspicious! Aftei all, we are examining the behavior of human subjects. 
Look back at Figure 11.5. The estimated coefficients are roughly the same 
size as tire hypothesized ones. The question is, how close is close enough? 
One way to judge this is by constructing a test to see if the estimated coef¬ 
ficient is statistically different from the theoretical prediction. To do this, 
start with tire formula for the t statistic given in Equation (6), which we 
rewrite here: 

t=(x-fi)/( s/Vn) ( 6) 

The null hypothesis is that the two coefficients are equal; thus, substi¬ 
tuting tire null hypothesis into the formula yields tire following: 

t=(b-b)/(. s/Vn) ( 9 ) 

or 

t={b-b)/ SE (10) 

where: 

• b is the estimated coefficient, 

° b is the predicted coefficient, 

6 SE is tire standard error of the estimated coefficient. 

In our example, the test statistic is (0.91 - 1.0)/2.6 = -1.6. 7 Since this 
estimated ris less than the critical tof 2.0, we can conclude that the estimated 
coefficient is not statistically different from the dreoretical coefficient. In 
odrer words, dre estimated coefficient is close enough. As such, this result, 
like the test of the algebraic sign earlier, supports the hypothesis. 

Note that the same test can be conducted on the constant term. In tiris 
example, since the predicted coefficient is zero, you can use the default 
test reported by the regression software without making any calculations. 

Statistical versus Economic Significance 

Performing a statistical analysis of the estimation results is the begin¬ 
ning rather than the end of the assessment. Even when the dreory makes 
no specific pr ediction about the size of the coefficient, to be useful in the 
real world the analyst needs to know the magnitude of the relationship. 
Consider the following equation: 

GDP = a + b INT (11) 

where INT is die interest rate. In the context of monetary policy, it matters 
a great deal whetiier b is -0.001 or -1000. 

McCloskey (1998) argues that economists often confuse statistical sig¬ 
nificance with scientific or economic significance. Suppose that in our ear- 
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lier regression, the estimated t statistic for b was greater tiran the critical t, 
so one could conclude drat the coefficient is statistically different from 
zero. But if the coefficient is numerically small (e.g., -0.001), tiien the ex¬ 
planatory variable (in diis case, die interest rate) may not matter much in 
practical terms. In other words, even tiiough it is statistically significant, it 
may not be a very important determinant of the dependent variable, GDP. 

This point is reinforced by Kennedy (2003) who observes that since nearly 
any relevant variable will have some (i.e., small) measured effect on die de¬ 
pendent variable, any variable can be made statistically significant if die sam¬ 
ple size is large enough. But what truly matters is the size of die effect. 

McCloskey also shows the converse argument: that a variable can fail the 
statistical test but still be economically important. The dieory of purchas¬ 
ing power parity (PPP) asserts diat the exchange rate between two curren¬ 
cies should be equal to die ratio of the two nations’ aggregate price levels: 

ER - Pus/Pjapan (12) 

where: 

° ER is die exchange rate (U.S. dollars/Japanese yen). 

• P m is the consumer price index in the United States. 

* Pjapan is die consumer price index in Japan. j 

Suppose a test of the theory of PPP yields a coefficient of 0.9, which is 
shown to be statistically different from die theoretical hypothesis of 1.0 
(peihaps because of a large sample). Should one conclude that die theory 
fails? Careful consideration of the theory could lead to the conclusion that 
real-world imperfections such as transportation costs, which the theory ig- 
noies, prevented die coefficient from being closer to the theoretical one. 

In short, even diough die estimated coefficient is statistically different 
fx om die null hypodiesis, it may be economically close enough, meaning 
diat the behavior posited by die dieory is in fact occurring. 

Are the Coefficient Estimates Plausible? 

When you examine a coefficient estimate, you should ask several questions. 
Suppose that you have estimated die demand for new cars as a function of the 
price of new cars, as well as a number of otiier explanatory variables: 

Q d = a + b 2 P + b 2 X (i 3 ) 

where: 

6 Q d is the quantity of new cars demanded, 

6 P is the price of new cars, and 
• X is the other explanatory variables. 
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NOTES FOR NOVICE RESEARCHERS 

The Largest Estimated Coefficient Is Wot Always 
the Most Important! 

Researchers sometimes think that the variable that has the largest 
estimated coefficient is the most important. After all, it has the largest 
impact on the dependent variable. Or does it? Suppose that one 
variable is defined in millions while another is defined in billions. If the 
coefficient on the latter is smaller than the coefficient on the former, it 
may still have a larger impact on the dependent variable than the 
former if it is less than one thousand times smaller. Alternatively, 
suppose that one variable is measured in billions of dollars but the 
other is measured in percentage points (e.g., interest rates). It’s not 
obvious, then, that the larger coefficient has the larger effect, since a 
one-unit change in billions of dollars is not the same as a one-point 
change in the interest rate. Another common error is to think that the 
variable that has the largest t-statistic is the most important variable. 
As we noted earlier, a small coefficient with a large t statistic has only 
a small effect, so don't make this mistake. 


What are the units of tire parameter estimates? If you don’t know what 
the units are it is difficult to interpret the meaning of the parameter esti¬ 
mates. Suppose price is measured in thousands of dollars, and the quan¬ 
tity demanded is measured in diousands of cars. Then, from Equation 13, 
bj is AQ d /AP = thousands of cars/thousands of dollars = cars per dollar. 
You should always use meaningful units to report regression results! 

Given die units, are the parameter estimates plausible? Are they consis¬ 
tent with common sense? Suppose bj is estimated to be —5,226,371,462. 
This implies that a one-dollar decrease in price causes an increase in new 
car purchases of roughly 5.2 billion cars. Given that total U.S. car produc¬ 
tion in 1999 was approximately 5 billion cars, this coefficient estimate is 
suspect. It is not reasonable to think that demand for cars would double if 
the price fell by merely one dollar. 

Getting implausible parameter estimates suggests an econometric prob¬ 
lem of one type or another. For example, tire model might omit a key vari¬ 
able or the underlying relationship could be nonlinear. We will turn to 
these issues shortly, in Step 4. 

Ill 
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NOTES FOR NOVICE RESEARCHERS 

Assessing the Plausibility of an Estimated Coefficient 

The example in Figure 11.4 included income as a determinant of 
consumer spending. Therefore, the estimated coefficient on income is 
the marginal propensity to consume. The coefficient estimate was 
1.04, which, strictly speaking, is outside the range of theory. But is it 
implausible? There is a great deal of evidence to suggest that 
consumers went on a spending spree during the 1990s. The estimate 
of 1.04 is not significantly different from the upper limit of the 
theoretical range, so one could conclude that it is plausible. By 
contrast, if the estimated coefficient were five or ten times larger or 
smaller, it would not be plausible. 


In more general terms, one way to evaluate tire plausibility of parame¬ 
ter estimates is to convert them into elasticities. 8 For a linear demand 
equation, tire estimated coefficients are AQ d /AP, AQ d /AY, and so on. 
The price elasticity of demand is defined as %AQ d /%AP, or AQ d /AP * 
P/Q d . To obtain an average value for the elasticity, you can multiply the 
estimated coefficient by tire ratio of the mean price in tire data sample to 
tire mean quantity demanded. Other demand or supply elasticities can be 
computed in a similar- way. For example, to estimate tire average income 
elasticity of demand, you can multiply the estimated coefficient on the in¬ 
come term by the ratio of the mean income in the data to tire meair quan¬ 
tity demanded. 

Another question to ask when assessing the values of estimated coeffi¬ 
cients is whether or not they are consistent with economic theory. In 
other words, does theory imply anything about the size of a coefficient? 
For example, in estimating a consumption function, the parameter on in¬ 
come is the marginal propensity to consume, which theory says should be 
a positive fraction (i.e., 0 < b < 1). Is your estimated coefficient a positive 
fraction? 

Additionally, you should ask how the parameter estimates compare with 
previous estimates by other scholars. Are they near that range? If they are 
too far- off, that might indicate an econometric problem, which should be 
investigated. If your coefficient estimates “pass” all of these tests, you can 
treat them as valid. 
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You should note diat it is not uncommon to have somewhat mixed re¬ 
sults in which some parameter estimates are consistent with your a priori 
expectations (i.e., close enough), while others are not. These are imper¬ 
fect tests, after all, and the data are rarely ideal. In these cases, the analyst 
needs to exercise judgment. Do die explanatory variables involving die spe¬ 
cific hypothesis support die theory, while control variables may not? Are 
most of the variables consistent with die dieory? 

How "Good a Fit" Is the Regression Model? 

We noted earlier that regression analysis selects parameter estimates in 
order to create a regression line that “best” fits die data. Most regression 
software computes test statistics to help the user determine how well this 
goal is achieved. The two most common of diese are the R 2 (or adjusted 
R 2 ) and the F statistic. 

The R 2 statistic is an estimate of the proportion of the variation in the 
dependent variable that is explained by the model. The higher the pro¬ 
portion, die better the model explains the variation in the dependent 
variable. Note tiiat this is not identical with “proring” your hypothesis. We 
will explain this point in more detail shordy. Because of the way that R 2 is 
computed, whenever you add more explanatory variables, the R 2 will in¬ 
crease. In fact, the R 2 will increase even if the variable you add does not 
significantly affect the dependent variable. But this is cheating! The ad¬ 
justed R 2 , usually called R 2 corrects for this. We won’t distinguish between 
the two from here on, but if you have a choice you should rely on the 
adjusted R 2 . 

The F statistic tests the hypothesis diat all estimated coefficients are 
jointly (or collectively) equal to zero; that is, it tests die hypothesis that b 1 
= b 2 — ... — b n = 0, for all n coefficients. This works much like the t test 
described previously. If the estimated F statistic is greater than the critical 
F obtained from statistical tables, then we can reject the null hypothesis 
and conclude diat all coefficients are statistically significant. Thus, the 
model as a whole has some validity. Note that the F statistic examines the 
same question as the R 2 does: namely, how well the model as a whole w'orks 
at explaining the data. 

Statistical significance is a good thing, but it is often overrated. For ex¬ 
ample, it is common for notice researchers to believe that the single most 
important result of a regression analysis is a “high” R 2 . This is, frankly, not 
true. In die context of testing hypodieses, R 2 can proride useful diagnos¬ 
tic information, but only indirectiy. (By contrast, in the context of fore¬ 
casting, R 2 is more direcdy relevant.) A low R 2 may be an indication that 
one or more explanatory variables are missing from die model; thus, in¬ 




cluding diem could improve the accuracy of the results. Kennedy (2003) 
points out diat a high R 2 may be caused by the peculiarities of a specific 
data sample—though it fits the current sample well, it may fit other sam¬ 
ples poorly. Thus, internal validity may be achieved at die cost of external 
validity. In short, statistical significance is neither necessary nor sufficient 
to confirm one’s hypothesis. Consider another example. If die estimated 
coefficient has die wrong sign but is statistically significant, die hypothesis 
fails. This issue will be discussed more in Step 5, later in this chapter. 

One final note: R 2 for cross-section data is generally less titan R 2 for 
time-series data. Econometricians typically consider a time-series regres¬ 
sion to be “good” if it results in an R 2 of 0.8 or higher. By contrast, a cross- 
section regression is considered “good” if it has an R 2 of only half that: 0.4 
or above. 

Step 4: Check For and. If Necessary, Correct Common 
Problems of Regression Analysis 

We began this chapter by framing die question that is the focus of the 
chapter: how does one use regression analysis to empirically test a hy¬ 
pothesis? The answer is that regression provides estimates of coefficients 
diat show the signs and magnitudes of a hypothesized relationship, when 
controlling for other relevant factors. 

However, the validity of the ordinary-least-squares (OLS) regression 
mediodology, which we discussed earlier in this chapter, depends on a 
number of technical assumptions holding true. In practice, it is not un¬ 
common for one or more of these assumptions to be false. When that hap¬ 
pens, the resulting parameter estimates are not ideal, which means that 
the process may not adequately test die hypothesis. 9 So, it is always prudent 
to investigate this possibility. Indeed, you should do so before you invest a 
lot of time and effort in interpreting the initial regression results you ob¬ 
tained. Typically, econometric software packages provide summary statis¬ 
tics to enable the researcher to do this. 

Let’s consider five common problems of regression analysis: autocorre¬ 
lation, heteroskedasticity, specification error, multicollinearity, and simul¬ 
taneous equations bias. We trill define each problem, suggest how to 
diagnose it, and offer possible treatments. Note that this chapter, as a brief 
introduction to regression, will not discuss regression problems in great 
depth. For a more complete treatment, see any econometrics text, such as 
those listed in Suggestions for Further Reading at the end of the chapter. 

Problem 1: Autocorrelation The ordinary-least-squares regression model 
is based on the premise that the underlying relationship between the 
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dependent and explanatory variables is linear (or additive). In other 
words, the regression equation can be expressed as follows: 

Y = b 0 + b,X x + b 2 X 2 + ...+ b n X n + e (14) 

where Y is the dependent variable, the X’s are one or more explanatory 
variables, and e is the error term. The model further assumes that the er¬ 
ror in each observation of the data sample is independent of tire error in all 
the others. For example, what this means is that if one error is positive tire 
next error is no more likely to be positive than negative. 

Suppose this were not the case. If a positive error was more likely to be 
followed by another positive error, or a negative error was more likely to 
be followed by another negative error, then we say that the errors are auto 
(or serially) correlated. Autocorrelation means that the errors are depen¬ 
dent on or correlated with each other. (Serial correlation refers to auto¬ 
correlation in time-series data; it means that the error in one time period 
is dependent on or correlated with the error in the next period.) 

The consequence of autocorrelation is that the standard error measured 
by the regression software will be biased downward, so that the reported t 
scores will be artificially inflated. This means that tests of significance may 
indicate that the explanatory variables are statistically different from zero 
when they are not. 

Autocorrelation suggests that something systematic in the determina¬ 
tion of the dependent variable has been omitted from the regression equa¬ 
tion. One possibility is that a relevant explanatory variable has been left 
out. This would be an example of a specification error, that is, using an in¬ 
correct model. In this case, the solution would be to add the missing vari¬ 
able to the estimation. 

True autocorrelation, however, occurs when the problem resides in the 
errors themselves rather than being a symptom of something else, for ex¬ 
ample, some sort of inertia that allows a disturbance to spread either 
across time, or space. One example of this might be a lagged adjustment of 
investment to a change in interest rates. Another could be when a down¬ 
turn in the economy spills over from one state to a neighboring one. What¬ 
ever the cause, first-order autocorrelation is modeled as follows: 

e t = pe t -1 + u t (15) 

where p is a fraction that indicates the extent to which the error in one ob¬ 
servation spills over to affect the error in the next observation, and u t is a 
random component of the error. 

The easiest way to check for first-order autocorrelation is to use the 
Durbin-Watson statistic (DW), which is computed by most econometric 
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NOTES FOR NOVICE RESEARCHERS 

Quick Rule off Thumb for the Durbin-Watson 

In an earlier note, we observed that a quick mle of thumb is to use 
2.0 as the critical t in hypothesis testing. Here is a rule of thumb for 
assessing the possibility of autocorrelation: The closer the Durbin- 
Watson is to 2.0, the less likely it is that autocorrelation is a problem. 

software packages by estimating p in Equation 15. When p = 0 (meaning 
no first-order autocorrelation), DW = 2.0. This is the null hypothesis. The 
greater the difference between DW and 2.0, tire more likely it is that auto¬ 
correlation exists. Specifically, there are two critical DW statistics: DWu 
and DW l (for upper and lower, respectively). If the estimated DW is 
greater than DW U> then one can conclude that DW is not statistically dif¬ 
ferent from 2.0, and thus, one cannot reject the null hypothesis of no au¬ 
tocorrelation. This is the desired result. If DW is less than DW L the null 
hypothesis is rejected in favor of the alternative of first-order autocorrela¬ 
tion. If DW falls between DW L and DW Uj it falls in the ambiguous range 
and requires the researcher to make a judgment about how to proceed. 

Most econometric software gives users the ability to correct for first- 
order autocorrelation by estimating the relationship and adding it to the 
regression model. As a result, when the model is estimated the t scores 
will be correctly reported and the coefficient estimates correctly as¬ 
sessed. Consult your software’s documentation to determine how to per¬ 
form this fix. 

Problem 2: Heteroskedasticity The ordinary-least-squares regression 
model also assumes that though the errors are independent from one an¬ 
other, they have constant variance. This implies that large values of a de¬ 
pendent variable are no more likely to have large errors than are small 
values of the dependent value. Though this is a statistical requirement, 
there may well be economic reasons why it fails to occur. Kennedy (2003) 
points out that when you are considering the relationship between con¬ 
sumer spending and income, it would not be unlikely for you to discover 
that the “errors” in spending increase as income does. 

The consequences of heteroskedasticity are similar to those of autocor¬ 
relation: the standard errors on the estimates of the coefficients are bi¬ 
ased. As a result, one cannot draw correct inferences about the statistical 
significance of the parameter estimates. 10 
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Assessing the Possibility of Autocorrelation 

The regression results presented in Figure 11.4 give the estimated 
Durbin-Watson statistic as 1.43. For a sample size of 45 and two 
explanatory variables the lower limit for the critical DW is 1.43, while 
the upper limit is 1.62. Since the estimated Durbin-Watson statistic is 
near the bottom of the ambiguous range, this strongly suggests that 
autocorrelation is a problem. To address the problem, we reestimate 
the model using a first-order autoregressive term. The results are as 
follows: 

Dependent Variable: CONS 
Sample: 1990:4 2002:1 
Included observations: 46 


Variable 

Coefficient 

Std. Error 

t Statistic 

Prob. 

C 

-321. 68 

288.51 

-1.156 

0.2711 

W5000 

0.025 

0.0105 

2.391 

0.0215 

REAL_DPI 

0.942 

0.0617 

15.257 

0.0000 

AR(1) 

0.473 

0.1525 

3.100 

0.0035 

R-Squared: 

0.9943 





Adjusted R-Squared: 0.9939 

Durbin-Watson: 2.2277 

F: 2394.7 

Note that the Wilshire 5000 is now statistically significant, and the 
marginal propensity to consume out of income (.942) is now less 1.00, 
which is consistent with theory. 


The easiest way to detect heteroskedasticity is to examine a plot of tire er¬ 
rors against each explanatory variable. If tire errors are roughly tire same as 
the explanatory variable increases, there is probably iro heteroskedasticity. 
If tire errors appear to increase or decrease in magnitude as the explanatory 
variable increases, you may have a problem. You may then want to go on to 
a more sophisticated test, such as the Goldfeld-Quandt test or the White test. 

Most econometric software allows the user to correct for heteroskedas¬ 
ticity. As a result, when the model is estimated the t scores will be correcdy 
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reported and the coefficient estimates correctly assessed. Consult your 
software’s documentation to determine how to perform this fix. 

Problem 3: Simultaneous Equations Bias The ordinary-least-squares 
regression model assumes that all the explanatory variables are truly inde¬ 
pendent or exogenous. That means that die explanatory variables are de¬ 
termined outside die model being estimated. Consider, for example, die 
demand for pizza on die part of a plumber named Susan. We generally as¬ 
sume that an individual’s demand for pizza depends on his or her income. 
Now, Susan’s income is determined by the wage rate she earns as a 
plumber multiplied by die number of hours she works. As a result, Susan’s 
income is independent of her purchases of pizza. Or to put it a different 
way, her income is exogenous, and we could estimate her demand for pizza 
as a function of her income without difficulty. 

But suppose that, instead of being a plumber, Susan manages a pizza 
parlor and she works on commission. Thus, in contrast to die previous 
case, her income would be affected by die number of pizzas she buys (as 
well as the number that she sells). In other words, her income is endoge¬ 
nous, or determined inside the model being studied. In this case, if we es- 
. timated her demand for pizza, the coefficient on income would be biased. 

When we estimate Susan’s demand for pizza, we want to determine die 
effects that income has on her pizza purchases. The problem here is that 
Susan’s income is also a function of her spending on pizza, so that an in¬ 
crease in her income leads to an increase in her spending on pizza, which 
leads to an increase in her income and another increase in her spending. 
As a result, the coefficient showing the effect’of the effect of income on de¬ 
mand will be overestimated. 

Now this example is an extreme one by design, but the problem is a 
general one. When an explanatory variable is endogenous, the resulting 
parameter estimates will be biased. In fact, this is called simultaneous 
equations bias. 

The solution is to take into account tire relationship between tire de¬ 
pendent variable (e.g., Susan’s pizza purchases) and tire explanatory vari¬ 
able (e.g., Susan’s income) by using a more sophisticated estimation 
technique than ordinary least squares. Such a technique is beyond the 
scope of this book, but the problem of possible simultaneous equations 
bias is nonetheless something all researchers should be aware of. 

Problem 4: Specification Error To obtain valid estimates of regression 
coefficients, we must specif)' tire model being tested correctly. There are 
two aspects to model specification. First, tire model must include tire cor¬ 
rect explanatory variables; the reason for this should be self-explanatory. 
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Second, the hypothesized equation must have die correct functional form. 
In odier words, if you are using ordinary-least-squares regression, the re¬ 
lationship being estimated must be linear, or approximately linear. 

This means diat one can commit diree lands of specification errors. 
First, tire model may be missing one or more relevant explanatory vari¬ 
ables. Second, the model may include one or more irrelevant explanatory 
variables. Third, the underlying relationship may not be a linear one. 

If you leave out a relevant explanatory variable, your estimated coeffi¬ 
cients will be biased, and the estimated t statistics will be artificially re¬ 
duced. This will make it appear that your coefficients are less statistically 
significant than they are. The effects of omitted variables show up either 
through tire constant term or in tire error term. Thus, a large and statisti¬ 
cally significant constant term suggests that tire model is incompletely 
specified. Similarly, autocorrelation may indicate an omitted variable. This 
is why theorizing before estimation is so important. It also suggests a solu¬ 
tion: returning to the theoretical drawing board and considering what 
other explanatory variables may be important to the model. Another op¬ 
tion is to review tire literature for more complete models. 

On the other hand, if you include an irrelevant variable, the resulting 
estimated coefficients will be unbiased. However, they will have inflated 
standard errors and reduced i statistics, which again makes it harder to 
draw statistical inferences. 

Finally, the solution to nonlinear models is also theory. Does theory sug¬ 
gest a linear or nonlinear relationship? If tire latter, there are several ways 
to proceed. One way is to use OLS and hope tire relationship is approxi¬ 
mately linear. Another way is to transform the data into a linear form, as 
discussed in Appendix 11A. The third alternative is to use a nonlinear es¬ 
timation technique. Such a technique is beyond the scope of this book. 

Problem 5: Multieollinearity Multicolliirearity occurs when two or more 
explanatory variables are highly correlated. Kennedy (2003) defines mul- 
ticolliirearity more formally as occurring if there exists an approximate lin¬ 
ear combination of two or more variables. An easy way to check for 
multieollinearity is to look at the correlation matrix for the explanatory 
variables. A standard rale of thumb is that if the correlation between two 
explanatory variables is more than 0.8, multieollinearity is likely to be a 
problem. 

The consequence of multieollinearity is that the regression software 
doesn’t know how to allocate tire joint effects of the two collinear variables 
on tire dependent variable. So, as Kennedy (2003) points out, tire software 
throws out tire joint effects and only gives “credit” to tire unique effects of 
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NOTES FOR NOVICE RESEARCHERS 

Symptoms of Multieollinearity 

One sign of multieollinearity is that small changes in the data 
sample cause significant changes in the parameter estimates. Another 
is a regression that yields high R-squared but low t statistics, so few 
coefficients are significant. 


each explanatory variable. As a result, tire parameter estimates are ineffi¬ 
cient with high variances—as if tire independent variable lacked sufficient 
variation. The parameter estimates are still unbiased, but tire variances are 
inflated. Thus, the estimates are imprecise, which leads to low t statistics, 
making hypothesis testing difficult. 

The solution to multieollinearity is to provide additional information, 
either in tire form of a priori conditions (based on economic theory) or in 
the form of additional data. Another solution is to use an alternative form 
of an explanatory variable, one that hopefully is not so correlated with tire 
other explanatory variable. For example, one could create a proportion, 
such as investment spending as a share of GDP, instead of a level, such as 
investment spending by itself. 

Note that you may solve the multieollinearity problem by dropping a vari¬ 
able drat is highly correlated with another in tire model. However, this simul¬ 
taneously creates a specification error, so you should avoid this “solution.” 11 


NOTES FOR NOVICE RESEARCHERS? 

A Closing Comment on Regression Problems 

Sometimes novice researchers make too big a deal out of 
econometrics problems such as these. They think that if you can’t 
correct the problem, then no valuable information can be learned from 
the regression. However, the correct interpretation is a lot more subtle 
than this. It is not that these econometric problems render the 
empirical results invalid; it just means that they are possibly less valid 
than they would be absent the problem. Thus, the researcher must 
exercise care and judgment in evaluating conclusions. 
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Step 5: Evaluate the Test Results 

The last step in the regression process is to evaluate the results. This is re¬ 
ally the point of regression analysis. You have developed a hypodiesis. You wish 
to assess die evidence in die real world to see if it supports die hypodiesis or 
not. You have collected an appropriate and adequate data set, and deduced 
die predictions tiiat would confirm your hypodiesis. You run your regression 
to estimate the model. You have corrected all die regression problems you 
can, as desciibed in Step 4. You then interpret die results as described in Step 
3, diat is, you ask “What do die results say?” Now you must answer the more 
difficult question: what do die results mean with respect to your hypothesis? 

More often than not, die results will be mixed. Some estimated coeffi¬ 
cients will match your predictions, and some will not. Some estimated co¬ 
efficients will be statistically significant, and some will not. Remember tiiat 
if an estimated coefficient is not statistically significant, dien you should 
treat the coefficient as zero. Some trill be economically significant, and 
some will not. Some will be plausible, and some will not. The overall model 
may fit die data well or it may not. All misses are not equal, however. Did 
die key coefficients, tiiose that directiy assess your hypothesis, match? If so, 
dien you have at least some evidence diat supports the hypodiesis. 

Ultimately, interpreting regression results is more of an art than a sci¬ 
ence; it requires judgment, not merely computation. 

zm 



Summary 


SUMMARY 

Consider die hypodiesis diatY is determined by some causes X h X 2 , and 
X 3 . This hypodiesis can be expressed as an equation: Y = a + bjX j + b 2 X 2 
+ b 3 X 3 . The coefficient b 3 shows die relationship between Xj and Y, since 
if Xj changes by one unit, Y will change by b 2 units. Similarly, die coeffi¬ 
cient b 2 shows die relationship between X 2 and Y, and die coefficient b 3 
shows die relationship between X 3 and Y. 

The coefficient a shows everything besides X 1; X 2 , and X 3 diat affects Y. 

In order to test the hypothesis, we need to analyze data to estimate die val¬ 
ues of b h b 2 , b 3 , and a. This is the role of regression analysis. In other words, 
when conducting a regression analysis, we are given X and Y, and die ob¬ 
jective is to find values for die b’s and a in the equation. 12 

• Regression analysis can be broken down into five steps. Note that we will 
present diem here in a slighdy different order than we did in die text. 

• The first step is to state the hypothesis of your research. What is the 
dependent variable? What are die explanatory variables? What are the 
expected relationships between each explanatory variable and die de¬ 
pendent variable? In other words, what coefficients will confirm die 
hypothesis? 

° The next step is to test your hypothesis by performing the regression 
using appropriate software. 

° Once you obtain results, you should check to see if the regression 
suffers from common problems, including autocorrelation, het- 
eroskedasticity, simultaneous equations bias, specification error, or 
multicollinearity. Note that at least several of these problems can be 
addressed before the estimation—for example, by carefully think¬ 
ing about model specification or by examining the correlation 
between explanatory variables. To the extent possible, fix any prob¬ 
lems you discover. 

• Once you have die most correct results you can obtain, you are in a 
position to interpret the results. In other words, you need to answer 
the question: What do the results say? Are the estimated coefficients 
statistically significant? Are tiiey economically significant? Are they 
plausible in die light of theory and previous studies? How good a fit 
is the overall regression model? Finally, how closely do die estimated 
coefficients match die expected coefficients? 

• The last step in a regression analysis is to evaluate the test results in or¬ 
der to draw conclusions about your hypothesis. In other words, what 
do the regression results mean for your hypodiesis? On balance do 
they confirm or reject the hypodiesis? 

<225 
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NOTES 

1. Note that, strictly speaking, we mean linear in parameters rather than 
linear in variables. See, for example, Appendix 11A, which explains 
how some equations that have nonlinear variables can be transformed 
so as to be amenable to linear regression. 

2. The issue of linear versus nonlinear regression is part of the more gen¬ 
eral issue of model specification. Model specification involves select¬ 
ing a regression model that accurately characterizes the underlying 
relationship between the dependent and explanatory variables. There 
are three issues here: including all relevant explanatory variables, ex¬ 
cluding any irrelevant variables, and using the correct functional form 
(i.e., linear or nonlinear) for the equation. . 

3. This is equivalent to the bivariate model C = a + b SMW + e. Alter¬ 
natively, we could have plotted C and any other explanatory variable 
in Equation 4. 

4. Conventional assumptions include: (1) that the errors are normally 
and independently distributed with mean zero and (2) that the errors 
have constant variance. 

5. You should confirm this result, that t = b/SUL, with actual regression 
results. 

6. 1.68 is the critical t for 40 degrees of freedom; for infinite degrees of 
freedom, the critical t is 1.64. 

7. The standard error is the same one in Figure 11.5. It is implied by the 
data in Figure 11.5 and is usually reported by the statistical software. 

8. By estimating a demand or supply function in log-linear form, the re¬ 
sulting coefficient estimates are the respective elasticities. For more on 
the use of log linear functional forms, see Appendix 11A. 

9. Econometricians use the term best estimate for this ideal. The precise 
meaning of best goes beyond the scope of this text. For an explanation, 
consult any econometrics text. 

10. More formally speaking, both autocorrelation and heterskedasticity 
cause the coefficient estimates to be unbiased but inefficient. 

11. This assumes that tire collinear variable truly belongs in the equation 
according to economic theory. 

12. By contrast, the process of simulation or forecasting takes values for a 
and the b’s as given. Then, given values for X, one can predict corre¬ 
sponding values for Y. 


Exercises 


SUGGESTIONS FOR FURTHER READING 

Gujarati (2003)— Very pragmatic approach to econometrics. Less a study of 
tire field of econometrics than of how to use econometrics in research. 

Kennedy (2003)— Excellent guide to econometrics; more intuitive than 
most texts. This is one of the books graduate students have bought for 
years to supplement their assigned econometrics texts. 

McLagan (1973)— Readable introduction to regression analysis for nonex¬ 
perts published in the Federal Reserve Bank of Philadelphia journal 
Business Economics. 

Studenmund (2001)— Very intuitive undergraduate econometrics text. 

WyricMl994) —Nicely done introduction to econometrics for undergradu¬ 
ates, especially in its discussion of dummy variables and qualitative de¬ 
pendent variables. 

EXERCISES 

1. Suppose you plan to estimate a marginal cost function where MC — 
f (w, r, Q). What would you expect the signs on the estimated coefficients 
to be if the cost function is valid? 

2. Consider the Cobb-Douglas production function for the U.S. economy 
over the time period 1980 to 2000: Q = TL a K b . If we take tire natural 
logarithms of both sides of the equation, we obtain a log-linear pro¬ 
duction function: InQ = InT + a InL + b InK. What do you expect tire 
signs to be on the estimated coefficients of this production function 
and why? 

3. Collect thirty years of annual data for U.S. GDP, employment, and 
capital stock. Take the natural logarithms of each variable and esti¬ 
mate the log-linear production function. Interpret the results of the 
regression. Are the estimated coefficients statistically significant or 
not? Are the coefficients economically significant or not? Are the co¬ 
efficients plausible? To what extent do the estimated coefficients 
meet your expectations? 

4. How good a fit is the overall regression model? 

5. What does the Durbin-Watson statistic suggest regarding the possibility 
of autocorrelation? 

6. Examine the correlation matrix for the three explanatory vaiiables. Is 
there any indication of multicollinearity? 

7. Is there any evidence of other common regression problems? 
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Data Transformations 
for Regression Analysis 

Sometimes it is necessary to transform your data so you can either put 
tire data in the form required for your model or to solve an econometric 
problem. Natural logarithms are particularly useful in this respect. Con¬ 
sider tire Cobb-Douglas production function, which is tire most widely 
used version of an aggregate production function: 

Q=TL a K b (16) 

where Q is output, T is an index of technology, L is the labor supply, K is 
the capital stock, and a and b are parameters related to the marginal prod¬ 
ucts of labor and capital. This function is nonlinear, so we can’t estimate it 
directly using ordinary least squares. If, however, we take tire natural loga¬ 
rithms of both sides of the equation, we obtain a log-linear production 
function: InQ = InT + a InL + b InK Notice that this is like any other lin¬ 
ear function: Y = a + 6 J X 1 + b 2 X 2 . Therefore, estimating by using the 
ordinary-least-squares model is straightforward. Notice also that, with this 
form, the coefficients are estimates of the elasticities of output with respect 
to labor and capital. Similarly, if you estimate a log-linear demand or sup¬ 
ply function, InQd = InC + alnP + b InY, the coefficients are the estimated 
elasticities. For example, in this case, a is tire price elasticity of demand, 
and b is the income elasticity of demand. 

Suppose theory suggests that your model is nonlinear, but that taking log¬ 
arithms will not help. The classic example is a wage function in which wages 
are hypothesized to increase as a function of age, but at a decreasing rate: 

WAGE = a + b AGE + cAGE 2 (17) 


where b > 0 and c < 0. To estimate tire function, you first need to create a 
squared term for age, using a spreadsheet or your statistical software: 
AGE_SQUARED = AGE 51 AGE. Then you can run the regression using 
WAGE, AGE, and AGE_SQUARED. 

It is common to use a time trend in regression analysis—for example, to 
serve as a proxy for technology. The assumption is that technolog)' im¬ 
proves by a constant amount each year. To create a time mend, you simply 
create a variable that increases over the sample and thus has values of 1, 2, 
3, and so on. 
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Estimation Using Qualitative 
Variables 

Up to this point, we have been assuming that all variables we are using 
for empirical testing are quantitative in nature, that is, they are variables 
with a wide and more or less continuous range of values, such as incomes, 
prices, ages, and so on. For many applications, however, you may need to 
employ qualitative variables. Qualitative variables come in two varieties: 
dummy variables and limited numerical variables. Dummy variables are 
those defined by nonnumerical values, such as male versus female, college 
graduate or not, and racial or ethnic background—for example, African 
American, white, Hispanic, Asian American, Native American. Limited nu¬ 
merical variables are those whose values take only a limited range of inte¬ 
ger values, such as tire number of children in a family. Qualitative variables 
can be used either as independent or dependent variables. 


Qualitative Independent Variables 

When the qualitative variables are on tire righthand side of tire equa¬ 
tion, that is, when they are independent or explanatory variables, tire so¬ 
lution is to use a dummy variable approach. The idea behind dummy 
variables is that there is a discrete effect as' a result of gender, or education, 
or race or whatever tire qualitative variable is. 

The dummy variable is constructed so as to have a value of one or zero, 
depending on the case chosen. For example, you might use a dummy vari¬ 
able for gender in examining tire wage gap. By letting female have tire 
value of one and male have the value of zero, if tire estimated coefficient 
showed a negative sign it would indicate how much lower female wages are 
than male, when holding all other factors constant. 

Dummy variables can also be used for multiple categories, such as 
ranges of education: high school dropout, high school graduate, person 
with some college, college graduate. The technique is the same. The 
dummy variable for tire dropout category would have a value of one for 
those individuals who did not complete high school and a value of zero for 
everyone else. The dummy variable for tire graduate category would have 
a value of one for those individuals whose highest level of education was 
high school and a value of zero for everyone else. It is important that one 
dummy variable be omitted from tire estimation; otherwise, tire regression 
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mil exhibit perfect collinearity between die dummy variables (and in all 
likelihood die estimation will blow up). (That’s why we only use one 
dummy for the bivariate choices like male/female.) 

Dummy variables are also used in time-series studies of policy analysis, 
where die two conditions are “before die policy was implemented” and “af¬ 
ter Anodier example of dummy variables is the use of interaction terms. 
Sometimes explanatory variables affect die dependent variable in a more 
subde way than we typically assume. For example, we know that wages are 
influenced by bodi work experience and gender. Suppose we are inter¬ 
ested in seeing if die returns to experience are affected by gender. We 
could define work experience (EXP) as number of years working. Then we 
could define a dummy variable for gender, MALE, where Male has a value 
of one, and Female has a value of zero. Finally, we could define an inter¬ 
action variable MALEJEXP = MALE * EXP. This yields the following 
model to be estimated: 

WAGE = a + b EXP + cMALE + d MALE_EXP (18) 

Thus, if you are male, your returns to experience are b + d, while if you 
are female, your returns to experience are b. If the estimated value of d is 
statistically greater than zero, then we conclude diat males are rewarded 
significandy more dian females for equivalent increases in experience. 

The use of dummy variables causes no problems with ordinary least 
squaies 1 egression. You estimate your model in die same way as you would 
using quantitative variables. 

Qualitative or Limited Dependent Variables 

Foi some research, it makes sense to use a dummy variable on die left- 
hand side of the equation—that is, a dummy or limited dependent vari¬ 
able. Suppose you wished to model the determinants of housing choice by 
college students: on-campus or off-campus. The choices are binary: either 
they live on-campus or off. Recalling that dummy variables have a value of 
zei o or one, estimating such an equation using ordinary least squares runs 
into several problems. If the estimated equation predicts a value that is in 
between zero and one, what does diat mean? Similarly, what does it mean 
if it predicts a value less than zero or greater tiian one? 

The solution is to use an estimation procedure diat forces die depen¬ 
dent value to lie between zero and one. There are two such commonly 
used metiiods, the logit model and die probit model. For details on how 
to use diese methods, consult any econometrics text. 



"Science is an instance of writing with 
intent, the intent to persuade other 
scientists, such as economic scientists." 


D. McCLOSKEY 


0 \ research project is not complete until its findings are com- 

/ IIL. firi' municated to a larger audience, namely, the scientific 
^4 community. This is how knowledge progresses. This com- 
Ellisiil SSafemunication is performed in three steps. First, the re¬ 
searcher drafts a mitten report summarizing die results of his or her 
research. Second, die researcher presents the report orally at professional 
workshops or conferences. Finally, die researcher submits the research re¬ 
port for publication in die form of an article or book. 

Note diat each of diese diree steps may be done more than once. The 
report goes dirough multiple drafts as die audior attempts to perfect his 
or her argument. The primary purpose of giving a research presentation 
is to gain feedback from experts in die field. After the presentation die re¬ 
searcher is likely to rerise die research report again, based on die ques¬ 
tions and comments he or she received. The researcher may tiien give the 
revised report in anodier research presentation. It is common for papers 
submitted for publication to be rejected. The audior does not give up, 
however; rather, he or she looks for a new journal or new publisher until 
publication is assured. 

This chapter explains die nut and bolts of diis research communication 
process. The first half spells out how to mite a research report, and die 
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NOTES FOR NOVICE RESEARCHERS 


Venues for Undergraduate Research 

A growing number of conferences offer opportunities for 
undergraduate economics students to present their research. 
Examples of these include: 

• the annual meetings of the Virginia Social Science Association 

(http://www.vssa.net) 

e the Eastern Economic Association 

(http://www.iona.edu/eea/), and 

• the National Conference on Undergraduate Research 

(http://www.ncur.org/). 

Additionally, several journals exist that publish undergraduate 
economics research. Among these are the following: 

0 Issues in Political Economy (http://www.elon.edu/ipe) 

0 University Avenue Undergraduate Journal of Economics 

(http://www.econ.ilstu.edu/UAUJE/default.html) 

° The Student Economic Review (http://econserv2.bess 
.tcd.ie/SER/) 

• SOAS Economic Digest (http://www.soas.ac.uk/SED/ 
home.html) 

° The Visible Hand (http://www.rso.cornell.edu/ces/ 
publications.html) and 
° Opus 1 (http://www.opusl.org) 


second half descxibes how to prepare and give a research presentation. 
Since the report summarizes the process the researcher has undertaken, 
we will make numerous references back to earlier chapters. 

Writing the Research Report 

In Chapter 5, we introduced the idea that writing is a product of an 
intellectual endeavor. In this chapter, we present an in-depth example 
of that process: how the written research report becomes an outcome of 
the research process. Research xxports come in several varieties. They 
include reseaixh papers (both undergraduate and graduate), under- 


mi 



Figure T 2.1 Components of an Empirical Research Paper in Economics 

Title 

[Abstract] 

[Table of Contents] 

[Acknowledgments] 

Introduction and Literature Survey 
Theoretical Analysis 
Empirical Testing 
Conclusions 
References 

[Data and Other Appendices] 


graduate and masters’ theses, doctoi'al dissertations, scholarly journal 
ai'ticles, monographs, and books. What we present here applies to all 
types of research reports, though what comprises a section of a lxsearch 
paper or journal article likely takes up a chapter or more in a book- 
length report. Journal articles tend to be the most condensed. Theses 
and dissertations tend to be longer since the author needs to demon¬ 
strate his or her command of matei'ial that would be assumed in a jour¬ 
nal article or even a book. 

The purpose of the written report is to present tire results of your reseaixh, 
but more importantly to proride a pei'suasive argument to readers of what 
you have found. Recall from our earlier study of critical reading (Chapter 6) 
that there are two ways to drink about tire form of scholarly work. The first is 
format, and tire second is argument. Format is moix explicit, but argument 
is more important. The purpose of research is to advance knowledge in a 
field by providing a convincing argument supported by logic and empirical 
evidence. That is what your research report should be. 

Your paper should be organized according to tire format shown in Fig- 
uix 12.1, which we first introduced in Chapter 6. The optional parts of die 
report are listed in brackets. Feel fixe to use section headings, as tiiey will 
make die paper more readable. 

As you draft your paper, keep in mind diat the objective is to build a 
convincing case for your conclusions, namely, tiiat die hypothesis is con¬ 
firmed (hopefully!) or xxjected based on the logic of your analysis and 
your empirical evidence. You may find it useful to review Chapter 4 on how 
to construct arguments. 

HI 
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Introduction 

The purpose of the introduction to the research report is to provide 
the rationale foi the research. This rationale should address four issues: 
What is the nature of the issue or problem the research investigates? Wiry 
is it worthy of investigation? What have previous researchers discovered 
about this issue or problem? What does your research attempt to prove? 
In other words, what is the contribution that your research will make to 
tire literature? 

The introduction is a critical part of tire research report because it is of¬ 
ten here that readers decide whether or not to continue reading the rest 
of tire report. As tire author you need to think of a way to persuade tire 
reader to continue. 

Start the introduction by sketching out the problem that the research 
addresses. What evidence can you offer to describe tire issue? Wiry is it a 
problem? Is it a public policy issue, a social problem? Is it purely an intel¬ 
lectual puzzle? Who would be interested in tire problem? It is often help¬ 
ful to cite appropriate statistics to illustrate tire magnitude of the problem 
or, better yet, use a figure or chart. The following paragraphs, which begin 
the introduction of a typical research paper, provide an admirable exam¬ 
ple of how to frame tire problem: 

Researchers are unsure of when and where the human immuno¬ 
deficiency virus that causes the Acquired immune Deficiency Syn¬ 
drome originated. What is certain, though, is that HIV/AIDS has 
become a pandemic disease since its widespread recognition as a 
major global health crisis in the late 1980s and early 1990s. 

This disease represents a severe development crisis in sub- 
Saharan Africa, the most afflicted region in the world. More than 

29.4 million people are living with HIV/AIDS in sub-Saharan Africa, 
including 3.5 million new infections in 2002. Sadly, an estimated 

2.4 million Africans died from complications with HIV/AIDS in the 
past year. National adult HIV prevalence has reached astronomical 
proportions, exceeding 30% of the adult population in four southern 
African countries: Botswana, Zimbabwe, Swaziland, and Lesotho. 
Even South Africa, widely considered the most developed country in 
Africa, has been touched by the far-reaching socioeconomic conse¬ 
quences of HIV/AIDS. It has become the most afflicted country in the 
world, with 4.9 million people living with HIV/AIDS. 

Though it is not yet clear where this introduction is going, tire reader 
should have no doubt that the HIV/AIDS epidemic in southern Africa is 
a substantial social problem. 
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; NOTES FOR NOVICE RESEARCHERS 
Using Figures Effectively 

The purpose of charts, tables, and other graphics is to summarize 
and illustrate the argument in the text. Every figure should be 
designed to be easily understood independent of the text. Each figure 
should have a clear explanatory title. Each title should include two 
parts: a reference number (e.g., “Figure 12.1”) and a description of the 
figure (e.g., "Components of an Empirical Research Paper in 
Economics"). The description should be as precise as possible, rather 
than vague. If the figure shows a graph, axes should be clearly labeled, 
with the units clearly identified. If more than one concept is being 
graphed, make sure the reader can clearly tell which is which (e.g., by 
using diamond symbols, dots, and/or x’s for each element). If the figure 
shows a table, the numerical values in the columns should be aligned 
with the decimal points. In addition, each numerical value should have 
the same number of digits to the right of the decimal point. 

Figures are best located in the text rather than as appendices. An 
exception is if you are running numerous regressions. In this case, you 
should summarize your results with a table in the text, but put the 
complete regression results in an appendix. 

Make sure you refer the reader to each figure in the text of the report. 
Otherwise, they may not read the figure, which means it is not serving its 
purpose. This is especially hue for figures that are relegated to appendices. 




Next, explain to tire reader why the topic is interesting and significant. 
We discussed what makes a research question interesting and significant 
in Chapter 2. Is it, for example, a hot topic among experts in the field? 
A research question is interesting if it addresses a problem that hasn’t 
been entirely sorted out by the experts; in other words, some aspects of 
the problem have yet to be understood. A research question is significant 
if it deals with a substantial problem, one that has major impacts on the 
field. 

The next step in tire introduction is to summarize what has been done 
already to study the problem. This is tire literature survey, which will be dis¬ 
cussed in the next section of this chapter. Use that as a springboard to ex¬ 
plain what your research is designed to accomplish. 
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Write the Introduction Last! 

Many experts think that you should write the introduction to your 
paper last rather than first. This makes sense since you want to hook 
your reader with the introduction, so you'll want to know what your 
paper says before you write its beginning. Alternatively, you may draft 
the introduction first but revise it last. Note also that if you have 
written a research proposal, much of the proposal itself can form the 
basis for the introduction to your report. 


The Written Literature Review 

An important component of a research report is the literature review or 
survey. We discussed the process of reviewing the literature in Chapter 3. 
In a diesis or longer work, the written literature review may be a part or 
chapter separate from die introduction, but in research papers or journal 
articles it is usually included as part of die introduction. 

A literature review is a summary of die major studies that have been 
published on a research topic. The literature review should accomplish 
three goals. First, it should identify the major findings on a topic up to the 
present. Second, it should point out die principal deficiencies of diese 
studies or provide a sense of what is lacking in the literature. Last, it should 
conclude by leading into your research question, by explaining how your 
research proposes to contribute to die literature or address some short¬ 
coming of a previous study. Thus, die primary purpose of die literature 
survey in a written report is to provide a justification or rationale for your 
research. 

Writing the Literature Review 

Norice researchers tend to write excessively long literature surveys. This 
is understandable since drey are likely to have a lot of notes on die studies 
diey have read but perhaps fewer notes on and less confidence in their 
own original work. After all, summary is easier than analysis. It is impor¬ 
tant, however, not to allow the literature survey to “ dominate your paper" as 
Boodi et al. (1995,163) put it. For most research papers, the literature sur¬ 
vey should be biief. 

Students sometimes ask how many sources should be included in die lit¬ 
erature survey. The answer depends on how many major studies have been 

236 
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; NOTES FOR NOVICE RESEARCHERS 

What a Literature Survey Is and Is Wot 

Novice researchers sometimes seem confused about what exactly 
should be included in a literature review. The literature survey is not 
° a list of potential sources of information about your topic, or 

0 a list of sources that you reviewed, or even 
• a list of summaries of the sources you reviewed. 

Rather, the literature survey should be a summary of the major 
research done on the topic you are working with, where the objective 
is to give the reader only the background that will be necessary for 
them to understand your research. In other words, you shouldn’t 
summarize the studies you discuss in their entirety; rather, you should 
only highlight the parts that shed light on your research question. Nor 
should you discuss studies you read but decided weren't directly 
related to your research. Note also that the literature survey should not 
include popular publications, only scholarly ones. 



completed on die topic. If you only report one or two sources, readers may 
suspect that you haven’t put enough effort into searching the literature. 
You don’t want to miss a major study, since at best it will make you look 
careless and at worst it may weaken die rationale for your research. 

If you have cast your net wide enough to find all the significant studies 
pertaining to your research question, you will undoubtedly read studies 
that you discover aren’t really germane to your topic. These should be 
omitted from both the literature survey and the reference list at the end 
of die paper. Indeed, if you haven’t found irrelevant studies you probably 
haven’t surveyed widely enough. If you have doubts about whether or not 
to include a source, see if you can write a brief one-line description of its 
relevance to your study. If you have trouble doing this, you should proba¬ 
bly leave it out. Never include in your report discussion of any source that 
provided only background information. 

For each study drat you include in your survey, you should give com¬ 
plete citation information. You should use the appropriate style for cita¬ 
tions in the text of your survey (e.g., “Anderson (2003”) and in the 
references. (If you need to refresh your memory on citation styles, review 
Appendix 3Ain Chapter 3.) You should also indicate the question that the 

in 
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I II; author examined, die audior’s findings, and anything about die methodol- 

,;| °§y 01 s h e us ed that might be appropriate to your study—-for example, 

data sources or die type of empirical testing employed. Remember to focus 
jj youi commentary on those aspects tiiat shed light 011 your research question: 

III Armstrong (1991) was able to accurately predict the incredible so- 

; cioeconomic consequences of the HIV/AIDS epidemic, theorizing 

j: that H,V/ AIDS would alter the quality and size of the labor force and 

||j impede human capital formation. As soon as adequate data became 

:j available, formal data analyses were conducted. Over (1992) found 

the net effect of the AIDS epidemic on the growth of Gross Domes¬ 
tic Product (GDP) per capita to be a reduction of a third of a percent¬ 
age point in the ten most afflicted sub-Saharan African countries. 
Bloom and Mahal (1997) concluded the opposite, reporting that the 
! impact of AIDS was statistically insignificant, by using a data analy- 

| s ' s examining the effects of AIDS on GDP growth per capita in 51 in- 

j| dustrial and developing countries. Still, recent reports such as the 

| World Health Organization (WHO) report by the Commission on 

| Macroeconomics and Health predict a decline in economic growth 

due to the HIV/AIDS epidemic. 

'I 

j: Some studies can be summarized with a sentence. Others, for example, 

a paper whose model you are going to base your own research on, will 
need more detail. You want to be as concise as possible, while getting the 
I important points across. 

j Booth et al. (1995) suggest that if a source is an important one for your 

reseaich, you should name it directly in your summarizing sentence: 
ij Ruhm (2002) has found that the adverse effect of parental work 

|| hours on children's cognitive development is statistically significant. 

| If the source is less important, however, cite it parenthetically at the end 

; of tire sentence: 

Previous studies have found a mix of results, with little or no sta¬ 
tistical significance. (Blau, 1999; Barling & MacEwen, 1988.) 

You should conclude the literature survey by leading the reader back to 
youi reseaich pioject, explaining how your research will contribute to the 
literature: 

Since few studies on the subject exist, research on the impact of 
HIV/AIDS on productivity and the labor supply must continue to be 
conducted using accurate and more recent data on the prevalence of 
HIV/AIDS. This study improves upon the existing research by using 
actual AIDS data collected by each country, rather than predictions, 
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in attempting to discover the extent of the impact of AIDS on the la¬ 
bor supply in Botswana, Zimbabwe, Swaziland, and Lesotho; the four 
most afflicted countries in sub-Saharan Africa. 

Theoretical Analysis 

The purpose of this part of the research report is to present the theo¬ 
retical analysis of the issue or problem you are investigating. This is also 
described as presenting your theoretical model. We discussed how to de¬ 
velop this analysis in Chapter 7. Recall that the objective of research is to 
advance knowledge in a field by providing a convincing argument that is 
supported by logic and empirical evidence. This section of the research re¬ 
port presents the logical evidence as a deductive argument; the empirical 
evidence is presented in tire next section of the report. 

So, how does one explain the theoretical analysis? There axe several 
ways to do this, so let’s explain them from simplest to most complex, using 
examples from student papers. The basic approach is straightforward: you 
must clearly describe the the 017 you are applying to your research prob¬ 
lem, explain in detail why it is relevant, and then sketch out how it diag¬ 
noses the problem. This diagnosis is tire research hypothesis. 

If you are applying a common economic theory, such as the theory of 
demand, you should explain how your problem is an application of that 
theory and what the major elements of your analysis are. What plays the 
role of price, income, tastes, prices of related goods, and the like in your 
analysis? Note that since you are applying a common theory you should as¬ 
sume that your readers will be familiar with it. There is no need to derive 
tire law of demand or any other results that all students of economics 
know. Indeed, doing so will detract from your report . 1 

In order to demonstrate that consumers prefer financing incen¬ 
tives, the theory of demand will be applied to the sale of new auto¬ 
mobiles in the United States. According to the theory of demand, the 
demand for a certain good is a function of changes in the price of that 
good, the disposable income of consumers, and consumers' prefer¬ 
ences. ... Because a change in the interest rate on a loan used to pay 
for a good is effectively equal to a change in the price of a good, the 
two variables should have [a comparable] impact on the sale of new 
automobiles. ... I hypothesize that the interest rate elasticity of de¬ 
mand for automobiles will be larger than the price elasticity of de¬ 
mand. . . . [I]n accordance with the theory of demand, I predict that 
the price and rate elasticit[ies] . . . will be negative and that the in¬ 
come and measure of consumer sentiment elasticities] . . . will be 
positive. (Beck, 2003) 
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If you are modifying an existing applied theory, you should first restate 
that dieory in enough detail so readers can fully understand what the ear¬ 
lier researcher did. You should not assume drat they will be familiar with 
the previous dreory. Next, you need to explain how your research is an ap¬ 
plication or extension of that applied theory. Often, you need only indi¬ 
cate drat tire earlier theory was from the literature on your problem and 
that you have made a potential improvement. 

The audror of the following quotation modifies a model by Ehrenberg 
et al. ( 2001 ), which she indicates provides “a solid framework for the stud¬ 
ies of tire executive compensation in higher education.” After explaining 
the model in some detail, and observing that its empirical results were 
weak, she brings in additional factors that she hopes trill improve the 
model: 

I use an empirical model similar to Ehrenberg et al. (2001) to ana¬ 
lyze the factors influencing the salaries of liberal arts college presi¬ 
dents. In addition to college performance variables and human 
capital variables [used by Ehrenberg et al.], I include social capital 
variables that trustees may value in a president—leadership skills, so¬ 
cial capital, academic and managerial reputation. (Sorokina, 2003) 

As you sketch out the logic of your theoretical analysis it is helpful to re¬ 
member that you are trying to develop a deductive argument that culmi¬ 
nates in the research hypothesis, which is often expressed in the form of 
an equation of the model. Recall that deductive reasoning starts from one 
or more assumptions and derives a specific prediction from them. 

If your analysis includes formal mathematical reasoning, either an op¬ 
timizing approach or ad hoc, explain it here. What are the assumptions 
from which you start your analysis? What mathematical manipulations did 
you perform on the assumptions in order to obtain your research hypoth¬ 
esis as tiie result? 

In an empirical research paper, emphasis tends to be less on tire theo¬ 
retical analysis than on tire empirical analysis. This is as it should be. At the 
same time, you still need to do a complete job of explaining your theoret¬ 
ical analysis. Even if you are merely applying a common economic the 017 
or modifying, an existing applied theory, the theoretical analysis is more 
important than you may think. This is because the validity of empirical test¬ 
ing always depends on the underlying theoretical model. Empirical testing 
without theory only shows correlation rather than the causation that re¬ 
searchers seek. The causal inferences must come from the theoretical 
analysis! 
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The Importance of Explaining the Theoretical Analysis 

The purpose of empirical testing is to validate your hypothesis. 

Most undergraduates can successfully apply statistical software to a 
data set to perform a statistical analysis. The underlying question for 
any statistical analysis, however, is what does it mean? The results of 
the statistical analysis have no economic meaning unless they are 
interpreted in the context of an economic theory. The theory always 
has to come first. The theory explains where the hypothesis came 
from. The hypothesis is a proposition about cause and effect. If the 
statistical results are consistent with the hypothesis, then and only 
then do they support a causal interpretation. 



Empirical Testing of the Analysis 

The purpose of the empirical testing part of the research report is to 
provide the empirical evidence for your research argument. The theme of 
this section of the paper can be summarized as: Given your hypothesis, 
how did you test it and what were your findings? 

Your discussion should include a number of elements, including the 
data used, the empirical model and type of statistical analysis you em¬ 
ployed, tire results you hypothesized, your actual results, and your inter¬ 
pretation of tire results. We discussed developing an empirical testing 
methodology in Chapters 10 and 11 and finding data and constructing a 
data set in Chapters 8 and 9. If you don’t feel comfortable with this mate¬ 
rial yet, you might want to revisit those chapters. 

You’re not merely explaining what you did, but rather trying to per¬ 
suade the reader that your approach was a fair and accurate test of the hy¬ 
pothesis. For that reason, you need to think about, if not explicitly address, 
questions such as: Does your empirical model adequately represent your 
theoretical one? Are the data good enough to accurately test your hy¬ 
pothesis? Does your statistical methodology discriminate well enough to 
get at the question you’re asking and rule out alternatives? Does it ade¬ 
quately control for outside factors? Ultimately, you need to ask: if the em¬ 
pirical test obtains the best possible results, how confident can you be that 
tire hypothesis is correct? Let’s discuss each of these issues. 
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Testing Methodology Early in this part of the research report, it is help¬ 
ful to have a clear statement of the testing methodology, including the 
type of statistical analysis employed and a brief description of the scope 
of the test: 

This research uses a pooled Ordinary Least Squares regression 
analysis from 1990 to1998 for Botswana, Zimbabwe, Swaziland, and 
Lesotho. 

In order to test the hypothesis, this research performed an ordi¬ 
nary least squares regression using monthly data over the time pe¬ 
riod between 1996:1 and 2001:12. 

You may wish to explain why the statistical methodology is an appropri¬ 
ate method for testing the hypothesis. Ordinary least squares may be con¬ 
sidered the default statistical technique, though it may occasionally be 
adequate to use something less sophisticated like a statistical test of means. 
If you use any technique other than ordinary least squares, such as pooled 
OLS, generalized least squares, probit, and the like, you should explain 
why that is appropriate. 

Empirical Model This section of the report should include an equation 
for the empirical model. The empirical model is the actual version of your 
theoretical model that you test empirically. (Note that if you use a statisti¬ 
cal test that is less sophisticated than regression, there may be no empiri¬ 
cal model per se.) As such, there should be a clear correspondence 
between the variables identified in the theoretical model and those em¬ 
ployed in the empirical model. It is common that data for one or more 
variables in the theoretical model are not obtainable. Indeed, this is per¬ 
haps the principal difference between the theoretical model and the em¬ 
pirical model. There are two solutions to this difficulty. The first is to find 
data to serve as a proxy for the theoretical variable. The second is to omit 
the theoretical variable. Whatever approach you take should be explained. 
You should also explain how you obtained the empirical model from the 
theoretical model, in terms of functional form, mathematical manipula¬ 
tion, and so on. 

The empirical model is this: 

Q d = 3q + bf Pd + + bg Yp + b 4 A + e 

where P d is the average diamond price, M is the number of marriages, 

Y d is the disposable income of the consumer, A is the event of ad¬ 
vertising, and e is the error term. 

Data Explain the sources and methods of the data used to test the hy¬ 
pothesis. 2 What is the data set, and where did you obtain it? What data 
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transformations, if any, did you perform? If this discussion is complex, 
briefly summarize it in this section, and then refer tire reader to a more ex¬ 
tension discussion in the data appendix. This section of the paper should 
be explained in enough detail to persuade the reader that the data are ad¬ 
equate to fairly test the hypothesis. 

I compiled data from the Trade DataWeb at the U.S. International 
Trade Commission website on the quantity of diamonds imported, 
the average price of a diamond for each month, personal disposable 
income, and the number of marriages occurring each month. I also 
constructed a dummy variable to indicate when new advertising 
campaigns began. I found seventy-two data points for each variable, 
one for each month over a period of six years. This data is located in 
the appendix at the end of this paper. 

Hypothesized Results Before you present your actual results, it is critical 
for the validity of your test to identify the results that are predicted by and 
thus would support your hypothesis. For example, you should state the al¬ 
gebraic sign and magnitude (if this is predicted by the theory) for each 
variable in a regression equation: 

I expect certain relationships between these economic factors. I 
predict that as advertising (my measure of tastes and preferences) in¬ 
creases, demand for diamonds will also increase. Likewise, as con¬ 
sumers' personal disposable income increases, so will the demand 
fot diamonds. There should also be a positive correlation between 
marriages and number of diamonds demanded. Finally, I predict that 
as the price of diamonds increases, the quantity of diamonds de¬ 
manded will decline, although I do not expect this correspondence to 
be as strong as the other relationships. 

Actual Results Display your empirical results in a table or figure that is 
clearly visible in the text. (Only if there are many regressions should you 
put the results in the appendix; even then, summarize tire key results in a 
table in the text.) A sample is shown in Figure 12.2. The table should in¬ 
clude the sample size, estimated coefficients for each explanatory variable, 
some measure of statistical significance (e.g., Rvalue, t statistic, or standard 
error) for each variable, and goodness of fit information (e.g., Adjusted R- 
squared). It should also include any relevant test statistics for possible 
econometric problems (e.g., Durbin-Watson statistic). Like all figures, this 
one should be clearly labeled and titled. Finally, you should refer readers 
to the table, not simply assume or hope they will find it. 

Interpretation of the Results Once you have presented the empirical re¬ 
sults, you need to interpret them for the reader. We noted in Chapter 2 
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Figure 12.2 Regression Results 

Dependent Variable: Q 
Method: Ordinary Least Squares 
Sample: 1996:01 2001:12 

Included observations: 72 


Variable 

Coefficient 

t Statistic 

P Value 

C 

-2881179.000 

-5.976 

0.000 

P 

-2231.395 

-7.726 

0.000 

M 

1.469 

2.311 

0.024 

Dl 

585.034 

6.947 

0.000 

ADS 

-25958.720 

-0.263 

0.793 


R-Squared 

0.692 

Durbin-Watson 

Adjusted R-Squared 

0.668 

F-Statistic 


that this is the part of the research process that novice researchers seem to 
most overlook, and yet it is here that your research hypothesis stands or fails. 

How do the estimated coefficients compare with the predictions of the hy¬ 
pothesis? Were the estimated coefficients statistically significant? Were they 
economically significant? How do the results compare with diose of previous 
studies? What can you conclude about the results from the goodness-of-fit sta¬ 
tistics? Were there any statistical problems that need to be corrected in order 
to obtain valid results? If so, make clear that these are the corrected results. 

It is unlikely that you will obtain perfect results—for example, where 
every estimated coefficient has the correct sign and is economically and 
statistically significant. Discuss any anomalous results and suggest expla¬ 
nations, where possible. 

The results of my regression were fairly strong. My R-squared 
value indicates that I explained approximately 69% of the data with 
my model. All of my variables were significant except for advertising. 
Price was highly statistically significant which I did not expect. I was 
under the impression that De Beers kept prices fairly constant, but 
this is apparently not the case. The price coefficient was negative, 
which agrees with the theory of demand, in that price has a negative 
effect on the quantity demanded. Marriages were positively corre¬ 
lated with diamond demand, which also agreed with my hypothesis. 
This variable was not as statistically strong as some of my other fac¬ 
tors. I believe that this is due to the effect of substitutes; diamonds 
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Never cut and paste regression results from your statistical software 
into your paper. Rather, choose the results that you wish to present. It's 
good practice to identify the dependent variable in a regression as well 
as the sample size. Omit anything from the regression results that you 
don’t understand, since you could be asked to explain it. (Hopefully, 
this doesn’t include any of the important results described earlier.) 
Finally, don’t show more than a few significant digits for most entries 
in the table. 


are not the only stone presented in engagement rings, and not every 
bride receives an engagement ring. Disposable income was also a 
very strong variable. It is positively related to the demand for dia¬ 
monds, which indicates that as consumers' disposable incomes in¬ 
crease, the quantity of diamonds demanded will increase as well. On 
balance, my regression results seem to support my hypothesis. The 
majority of my variables were significant, and the coefficients had the 
signs that I expected. The advertising variable was not very satisfac¬ 
tory. I was expecting it to be positively, correlated with diamond de¬ 
mand, and highly significant. Unfortunately, the regression showed 
that it was in fact negatively correlated with diamond demand and 
not very significant at all. I believe that this may be the result of poor 
data. I could not find a definitive source for advertising data on the 
Internet, so I used the few press releases that I could find to construct 
the dummy variable. If I could run the regression again, I would like 
to find empirical data for advertising. Empirical data would result in a 
more scientific evaluation of the advertising variable. I would also like 
to find data on substitutes for diamonds, such as prices and quanti¬ 
ties of sapphires. I believe that these additions will allow my model 
to explain a large proportion of the data, improving its accuracy. 

Conclusions 

The purpose of this part of the research report is to summarize your 
findings, that is, to restate your argument and conclude whether or not 
it is valid. You should note that conclusions are inductive: in light of the 
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NOTES FOR NOVICE RESEARCHERS 


A Failed Hypothesis Does Wot Mean a Failed Project 

Note that it is perfectly appropriate to conclude that, based on the 
empirical testing, the hypothesis fails. This does not mean your 
research is a failure or that you will obtain a poor grade. Nor should 
you reverse your hypothesis so that it is confirmed by the testing—this 
is bad science. 


statistical results, what can you infer about your hypothesis? To what ex¬ 
tent did your empirical testing confirm your analysis? 

If the results confirm the hypothesis, before you congratulate yourself 
recall that those results may also be consistent with alternative hypotheses! 
In other words, a weak empirical testing design may not rule out plausible 
alternatives. Based on your empirical tests and results can you rule any al¬ 
ternatives out? If not, you should admit this in your concluding remarks. 
By admitting to the weaknesses of your argument, you actually make it 
stronger. 

If the testing did not confirm the hypothesis, can you suggest reasons 
why it didn’t? Were the data or testing methodology flawed? What do you 
know now that you wish you had known when you began the project? What 
could be done to improve the results, given additional time and effort? 

Finally, what can be concluded about the research question more 
broadly? 3 What have you learned from tire project that might help other 
researchers in the field? For example, the literature on tire relation be¬ 
tween parental employment hours and children’s cognitive development 
lacks consensus about whether the effects are permanent or temporary. 
This suggests that something fundamental is missing from the studies to 
date. A recent study concluded by proposing that what is missing in past 
studies is controlling for the quality of child-care, data for which has not 
been available in tire past. Thus, rather than continuing to test with exist¬ 
ing data, a more promising approach could be to develop data for child¬ 
care quality. 

Other Components of the Paper 

Research reports include several other components, some of which are 
optional. These include a title, an abstract, a table of contents, acknowl¬ 
edgments, references, and data and other appendices. 



Title All research reports should have a carefully composed title. A good 
title, like a good introduction, is designed to capture the reader’s interest. 
One way to capture interest is to pose your research question in the title, 
especially if it raises an angle that hasn’t been thought of before. An ex¬ 
ample might be Is Divorce Indebting Our Children ? Another way is to high¬ 
light counterintuitive findings of your research. 

A good title will have an enigmatic quality; it will identify tire research topic 
but leave enough unsaid to draw die reader in. One way to do this is by de¬ 
veloping a two-part title, where die first part raises die general topic, and die 
second pail poses a more specific question. Examples of this include: 

Intergenerational Mobility: Do First-generation Immigrants Lag Behind? 

All Men Created Unequal: Trends and Factors of Income Inequality in the 
United States 

Wage Discrimination in the NBA, Does It Exist for Foreign-Bom 
Players1 

Because online searches often used titles as targets, it is good practice 
to specify the key concepts of die research in your tide. You should be par¬ 
ticularly careful about avoiding vague tides that don’t make clear what die 
paper is about, such as 

Aggregate Reflections of Brand Loyalty or Unraveling the Size Effect 

If the reader can’t figure out what the paper is about, he or she is un¬ 
likely to decide to read it, however good the research may be. 

It is common to begin a research report with a tide page. In addition to 
the report’s tide, the tide page should include the author’s name and af¬ 
filiation and die date of the report. For a term paper, the affiliation would 
list the course the research is for and possibly die instructor’s name. For a 
thesis or journal article, it would indicate die author’s academic depart¬ 
ment and institution. 

Abstract It is good practice to write an abstract of your research report. 
Refer to Chapter 6 for how to do this. Remember that in this age of elec¬ 
tronic databases die abstract is likely to be searched for keywords. The ab¬ 
stract should be placed eitiier on the tide page or with the first page of text. 

Table of Contents Any book-length research report or diesis will benefit 
from a table of contents. Make diis a separate page. A table of contents is 
not usually included in ajoumal article. 

Acknowledgments It is considered good form to acknowledge any sub¬ 
stantial assistance you received in a research project. Acknowledgments in 
an article are usually positioned as an initial footnote under the title. For 
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arate page -pom, acknowledgments are p„ t on a sep- 

Reference List All scholarly works should Include a reference list or hib- 
roSSTchohrh'f °’ e 1 " dUS ! 0n of references is one feature tlrat distin- 
^ Jde comnlete t°H' ^i P" blications - The "*rence list should 
dte hr vom nan- f >ph ’ C Informati °" &r U>ose sources that you 

sty,e ' Refer bJ ■» 

styles. In general, it 

IuT f I ' matl0n ” ° r any S0UrCeS that y° u did not mention in the text 
Such refei ences are usually an indication of padding. 

ne!r nd A 0ther , APPendiCeS Research re P° rts occasionally include ap- 
pendtces. Appendices contain details that are not essential to grasp the re- 

i N0TES FOR novice researchers 

I Style Suggestions _ ~ _ ~ ~ " 

r^ a l!°!° WS 916 SUggestions yarding the style of a research 
report. Style is a matter of taste and common practice, so feel free to 
deviate from these if you have a good reason to 

It is common practice to put the abstract on the title page of the 
paper, after the title, author, and other information. The abstract should 
be single-spaced. Some authors use narrower margins for the abstract 

Thi° f t t wMe stffl «■«■*> *. 

on the page That s fine, but make sure you align the text of the 
abstract with the left margin or both left and right margins. 

Don t center-aiign the text in an abstract. 

It will make 

each line have a different length, which looks funny and 
unprofessional. 

The body of the paper should be double-spaced. Footnotes and 
references should be single-spaced. Include page numbers in the 
document for all pages after the title page. Always refer in the text to 
any figures that you provided to support your argument; you shouldn't 
' sume that readers will find and make sense of them on their own 
Before you begin drafting your paper, you may wish to review 
Chapter 5 on economic writing. Start writing the report early enough 

/dk 
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that you will have sufficient time for revisions. You don't need to wait 
until your results are finalized to begin writing. It’s not uncommon for 
researchers to draft the report, except for the results and conclusions, 
before completing their empirical work. You want to craft your argument 
to maximize its impact on your readers. For example, page-long 
paragraphs suggest insufficiently deep thinking about the story you are 
telling. Remember, a paragraph is supposed to explain a single thought. 

Don't begin your literature survey by indicating that no previous 
research has been done on your exact topic. That is actually a quite 
normal occurrence. Rather, think of a way to lead into the literature 
that you have found useful in formulating your own research. Refer to 
the authors of a study as people; don’t treat the study as if it were 
making the argument. Don't say "Francos and Schairer (2003) notes"; 
say "Francos and Schairer (2003) note." 

Don’t discuss the data or other empirical issues in the theoretical 
section of your paper. Remember, the theory section is supposed to 
develop your hypothesis, not test it. 

Always state the empirical results predicted by the theory before 
you present the empirical results. If you performed multiple iterations 
of the empirical test, minimize discussion of the process that you went 
through to obtain the results. Don’t say: I did this; then I did this; then 
I corrected for that. Rather, summarize the problems you ran into and 
what you did to address them; then present and discuss the final 
results. Don’t begin presenting your results'by discussing R-squared, 
since this is not the most important aspect of testing hypotheses. 
Estimated coefficients, which are not statistically significant, are 
considered zero, so don’t overemphasize their sign or magnitude. 
Similarly, don’t discuss the estimated coefficients independently of 
their statistical significance. If you present your regression results in 
the form of an equation, be sure that under each estimated coefficient 
you include the standard error, t score or p value so the reader can 
assess the statistical significance of each estimated coefficient. 

Don’t change your hypothesis after obtaining your results. It doesn’t 
make the paper stronger, and it violates the scientific method. Rather, 
stick with your original hypothesis, but report that it fails. The paper 
will be stronger, and you will get a better grade. (Failure of the 
hypothesis does not cause a lower grade! A low grade is given for a 
poorly argued paper.) 
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seaxxher’s argument, but that some readers may wish to have easy access 
to. A data appendix includes all the data you used in die empirical testing 
of dre research project, along with complete information about your 
sources and methods. Refer to Chapter 9 for more detailed discussion of 
how to construct a data appendix. Otirer appendices might include survey 
questions used to develop data or empirical results from multiple tests drat 
would distract readers from your argument if their details were included 
in the text of the report. 

Presenting Research Orally 

The second product of a research project is an oral research presenta¬ 
tion. There are different types of research presentations, depending on the 
audience and lengdr of the presentation. Some are directed toward a gen¬ 
eral audience, either an audience of economists who are not experts on 
die topic or an audience of noneconomists. General presentations tend to 
last relatively longer, say, thirty to sixty minutes, to enable the presenter to 
develop enough context to make his or her remarks comprehensible. 

At the odier end of the spectrum are undergraduate honors, master’s, 
or Ph.D. thesis presentations. These presentations are addressed to faculty 
who are assumed to know the discipline and die topic of the research. In 
general presentations, the speaker knows far more than the audience and 
needs to make his or her remarks general enough to be understood. In 
contrast, in thesis presentations the speaker is expected to demonstrate 
mastery of research techniques and the nuances of the topic. In a sense, it 
is as if the audience knows more about die subject tiian the speaker. The¬ 
sis presentations tend to last an hour or longer when questions and an¬ 
swers are included. 

The majority of research presentations are those tiiat scholars routinely 
engage in as part of their research. These presentations are between ten 
and twenty minutes long. They will be the focus of die remainder of die 
remarks in this chapter. 

Though we tend to diink of research presentations as providing re¬ 
searchers with an opportunity to disseminate research findings, more of¬ 
ten their primary purpose is enable die researcher to obtain feedback 
from other scholars before submitting his or her work for publication. 
Journeyman researchers tend to fear criticism of their work, perhaps see¬ 
ing it as criticism of themselves. By contrast, professional researchers look 
forward to criticism of their work because it helps diem improve it. 
Though you might think an absence of criticism from other scholars 
means your research is flawless, it is much more likely to mean that no one 
has carefully considered your work! This is especially true of an initial pre- 
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You Are the Expert on Your Topic 

Despite what you may think, rarely will anyone know as much 
about your research as you do. The only possible exception is the 
faculty member who is supervising your research, and you can be sure 
that he or she will be on your side in a thesis presentation. 


sentation. Like writing a draft report, essentially no one can present a re¬ 
search project perfecdy on die first try. In this light, Missimer (1995) ob¬ 
serves that, “If we can acquire die habit of seeing [research presentations] 
as opportunities to acquire wisdom, with no possibility of ‘losing’ an argu¬ 
ment since we always ‘win’ insight, it will be easy to cultivate a spirit of wel¬ 
come for any ideas” (p. 43). 

An Oral Presentation Is Different from a Written One 

Though both a research presentation and a research paper report the 
results of research, diey are structured very differendy. To begin with, you 
won’t be able to cover as much material orally as you can in writing. At 
least, you won’t be able to cover it well. 

This is time for several reasons. First, as we noted earlier, you will have 
only a limited amount of time to present,' often only about ten minutes. 
Second, it is substantially harder for an audience to assimilate an oral pre¬ 
sentation tiran a written one. This is pardy because of humans’ ability to 
process aural information compared to visual information. Additionally, 
in written discourse the reader can always go back and reread. In an oral 
presentation, they cannot. This leads to the tiiird reason. 

An oral presentation should be given at a lower technical level tiian a 
written presentation. 4 A research paper is implicitiy addressed to an audi¬ 
ence of experts in die field. Indeed, reviewers of papers submitted for pub¬ 
lication are always chosen from such experts. By contrast, a research 
presentation should be addressed to a more general audience. Even if the 
audience includes economists who have backgrounds in die general area 
of your interest, few trill be expert in your specific topic area. There is also 
a good chance diat at least some in the audience are merely interested in 
die topic but have littie or no previous expertise in it. 

In short, a research presentation should summarize your research paper. 
It shouldn’t attempt to replicate it in oral form. You should think of giving 
an oral presentation as telling a story; in odier words, it should focus more 
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on the research problem or question and less on the technical details. Of 
course, some technical points will be necessary in the presentation, for ex¬ 
ample, die equations of die model and die empirical results. But any tech¬ 
nical points or results should be illustrated widi handouts or graphics to 
make diem easier to understand. We will discuss each of tiiese points in 
more detail in die following sections. 

Preparing the Presentation 

You may not realize it but anxiety over public speaking is very common, 
especially among tiiose who lack experience witii oral presentations. The 
key to minimizing such anxiety is to know your material. This involves two 
steps: planning what you will say and practicing the presentation until you 
know it well. 

Let s explore how to plan an effective presentation, beginning with 
some general comments. First, your presentation should highlight what is 
novel about your research, whether it is a new or better model applied to 
the topic, a new data set, or a new statistical technique. Speech experts 
teach that an audience will only be able to take in and retain a limited 
number of ideas. As a result you must ask: what are the main points I want 
die audience to remember? This list should be relatively short. Remember, 
less is more. 

Preparing your presentation “notes” is an intensely personal and sub¬ 
jective tiling. It is often useful to make an outiine of the major points, as 
well as include definitions of key terms or quotations, equations of die 
model, and empirical results. There is no single correct approach. Some 
speakers use detailed notes; others use more limited ones. 

There is only one rule for presentations: Never read your remarks! Sim¬ 
ilarly, you probably shouldn’t memorize your research presentation either. 
When reading or presenting a memorized report, speakers act as if the au¬ 
dience is not there. By contrast, what you want to do is engage the audi¬ 
ence in a conversation. After all, you want tiieir feedback. To do tiiis, you 
want to “talk about” your research. You need to know die material, but you 
don’t need to know each word. 

The format for a research presentation is fairly well defined and similar 
to diat of a research paper. A research presentation should be structured 
to have four parts: die inU'oduction, die analysis, the results or evidence, 
and die conclusions. 

When you prepare your presentation, you should think explicidy and 
carefully about die introduction. You begin by explaining what your topic is. 

It is often useful to demonstrate die issue or problem with a figure. McCuen 
et al. (1993) argue that die first minute or two of a presentation is the most 
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critical part because it is dien diat audiences decide consciously or uncon¬ 
sciously whedier tiiey will pay attention to die rest of die presentation. 

So how do you engage die audience? By showing that die problem is in¬ 
teresting and wordi exploring. What has been done in die past to investi¬ 
gate die topic? Why has past research not been entirely satisfactory? In 
other words, why is die topic still a problem for researchers? This is die 
part of the presentation that is most like storytelling. You should plan on 
spending a substantial proportion of your presentation on the inU'oduc¬ 
tion, say, perhaps a tiiird. 

The next two sections make up the body of die presentation, which 
should compose roughly half of die allotted time. You should plan to 
spend relatively more time on the section diat discusses die principal con¬ 
tribution of your research, and relatively less on the otiier. The analysis sec¬ 
tion summarizes how your research, addresses the deficiencies in previous 
work. Here you should describe the approach you took to analyze die re¬ 
search problem, presenting your model or methodology. The empirical 
section describes how you tested your analysis, die results you obtained, 
and the extent to which they confirmed your analysis. 

You should also think explicidy about how you will conclude the pre¬ 
sentation. The conclusion is almost as important as die inu-oduction. It is 
here diat you tell your audience what you want them to remember from 
die presentation. You can signal fins by starting your conclusion with a 
phrase like: “Let me summarize . . .” or “In conclusion. . . .” Such a state¬ 
ment acts like a road sign for die exit on a highway to your destination. It 
signals the audience that the presentation is . almost over, and it naturally 
reengages tiieir attention. After giving tiiis signal you should state your 
conclusions clearly and concisely. In total, tiiis section should take about 
10 percent of die presentation’s time. 

Use of Visual Aids: Handouts, Transparencies, PowerPoint 

We pointed out earlier that die main weakness of die spoken word as a 
means of conveying information is diat visual comprehension is faster and 
more complete than aural comprehension. One solution to this very real 
problem is to use visual aids to complement your research presentation. 

Visual aids include handouts, transparencies, or PowerPoint presenta¬ 
tions. Handouts are printed pages, eitiier disnibuted individually or as a 
package. Transparencies are slides that are displayed using an overhead 
projector. PowerPoint is computer software diat creates presentations that 
can then be displayed as handouts, transparencies, or directiy from the 
software using either a display panel, which sits on top of an overhead, or 
a separate data projector. Note that handouts and transparencies can also 
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Opening Slide of a Presentation 

The opening slide of a presentation should be like the title page of a 
research report. Put the title of your paper as well as your name and 
institutional affiliation on the opening slide. It’s also common practice 
to include the name, location, and dates of the conference or workshop 
where you are presenting. 


be created using word-processing software. Each visual aid format has 
pluses and minuses. Let’s begin by discussing the use of visual aids in gen¬ 
eral; then we will examine the different formats. 

Two types of information benefit from visual aids: information that 
you want listeners to remember and information of a technical nature. 
Think about an outline of your presentation. You might use visuals to 
show the titles of each of the major sections of the presentation or each 
of the main points in the presentation. You might use them to show def¬ 
initions of key terms or quotations. You might also use them to list the 
conclusions of your presentation. Suppose you introduce the presenta¬ 
tion with statistics or data to illustrate the research problem. Showing a 
figure or chart will make more of an impact on the audience than simply 
quoting the statistics. Remember, a picture is worth a thousand words. 
Additionally, you should always use visuals to show the equations of your 
model or to present empirical results. All tables, figures, and charts 
should be fully labeled so the audience can understand them without any 
instruction from you. However you use visual aids, the key rule is that vi¬ 
suals should be designed to complement the presentation—they should 
never become its focus. 

PowerPoint can be a useful tool for organizing a presentation, whether 
in the form of handouts, transparencies, or directly via the software. Using 
your research paper or notes, you can “wing” an oral presentation, giving 
it without adequate thought or planning. You can’t wing a PowerPoint pre¬ 
sentation, since designing the presentation forces you to plan it. 

When you design each slide, you are creating talking points. For slides 
to have their most powerful impact, they should not be crowded with in¬ 
formation. You want the listeners to grasp each point with a minimum of 
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effort, and then return their attention to you, as you elaborate in more de¬ 
tail. Slides should show less information titan you discuss, not more. The 
exception to this is statistical results where you may want to show all the re¬ 
sults, but not discuss each explicitly. Note, however, that anything you in¬ 
clude on a slide is grounds for discussion, so never cut and paste entire 
regression results. Rather, always create your own table that includes only 
die results you are willing and able to talk about. 

Let the slides serve as tire outline of your remarks rather than tire text 
of them. Don’t merely read your slides. The audience wall read the slides 
faster than you can voice them, so if your presentation consists of reading 
tire slides verbatim, you will lose your audience. The exception to this rule 
is slides that contain definitions of key terms or quotations. These it is ap¬ 
propriate to read. 

As we observed earlier, each format of visual aid has pluses and minuses. 
Handouts are bulky to transport aird take time to distribute. Transparencies 
can be cumbersome to use. The placement of tire overhead projector can 
obstruct tire view of tire audience, especially when changing transparencies. 

Whether using transparencies or PowerPoint, inexperieirced pre¬ 
senters often fail to leave slides up long enough for audience members 
to read them, frustrating those who struggle to follow. Visual aids are 
supposed to enhance the presentation, not detract from it. This is es¬ 
pecially problematic for technical material. Remember that while you 
are familiar with your statistical results, the audience is seeing them for 
the first time. It is good practice to direct attention to the screen where 
slides are displayed, but avoid speaking to the screen. Remember to 
face your audience. One way to make this appear natural is to put the 
laptop that is running the presentation on your podium, which you are 
standing behind. Then follow the presentation on the laptop rather 
than from the screen. 

Handouts don’t suffer from these problems, but they do transfer atten¬ 
tion from you to your handouts. Know that as soon as you distribute tire 
handouts the audience will begin reading them and not pay as much at¬ 
tention to what you saying. One way to alleviate this is to distribute hand¬ 
outs one figure or page at a time, right at tire moment when you w'ant to 
transfer attention to the figures. 

The power of PowerPoint tends to take over the presentation—exactly 
what you don’t want it to do. If you use PowerPoint, you should minimize 
tire “bells and whistles.” Such features are like the annoying TV commer¬ 
cials in which tire audience remembers the commercial but not the prod¬ 
uct it was advertising. 
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Stay on Message! 

Your research presentation should emphasize what is most 
important in the research. Novice researchers sometimes have a 
problem seeing the forest for the trees. They talk about all the details, 
but fail to highlight what is most important about the research. What 
is the hypothesis? To what extent did the empirical results confirm the 
hypothesis? Often the test of the hypothesis is the estimated coefficient 
of a single explanatory variable. When that is the case, be sure to 
highlight it rather than treating it like any other coefficient estimate. 

Practicing the Presentation 

Once you have prepared your presentation, it’s a good idea to practice 
it at least a few times before tire actual event. At least one of those practices 
should be with an actual audience of friends. Practice will help you become 
familial' with tire material, as well as figure out what parts of the presenta¬ 
tion work well and what parts need to be rethought. Practice will also make 
you more comfortable with the presentation. 

Finally, practice will help you determine how long tire presentation ac¬ 
tually lasts, which is important for staying within tire scheduled time limit. 
Note that going through the presentation silently takes much less time 
than rehearsing out loud, so you should do the latter to determine the ac¬ 
tual length of your presentation. 

If you are using audiovisual equipment, make sure you know how to op¬ 
erate it before the actual presentation. If at all possible, test out tire equip¬ 
ment you will actually be using beforehand. With AV equipment assume 
the worst case. Plan out how you will complete tire presentation if the 
equipment doesn’t work properly. For example, you may wish to bring 
transparencies of a PowerPoint presentation in case tire hardware or soft¬ 
ware doesn’t work. The more sophisticated tire presentation, the more 
things can go wrong. If you plan for the worst, you probably won’t need 
your contingency. 

Giving the Presentation 

If you have prepared as described in tire previous sections, then giving tire 
research presentation will not be too difficult. You may feel nervous, which is 
normal. Except in extreme cases, tire audience will not notice nervousness. 
In any case, as you get into tire presentation, tire nervousness will diminish. 

m 


Presenting Research Orally 


Think of tire presentation as a conversation about your research, rather 
than as a speech. These people are here because they are interested in 
your topic and want to hear what you have to say. You, in turn, want to find 
out what they think and hear their suggestions for improvement. 

You should dress professionally for your presentation. Bring a few 
copies of your paper, since some people may well request it. If you ran out 
of copies you can always take their names and e-mail addresses and send 
diem the paper later. 

It’s a good idea to arrive early for the presentation so you won’t be 
rushed setting things up and getting comfortable with the room. Intro¬ 
duce yourself to the other speakers and discussants. If you’re presenting 
with PowerPoint and using someone else’s equipment, see if you can load 
your presentation before the session starts. 

As you begin the presentation, stand up so you can project your voice 
better. Look around the audience and make eye contact with-them. Eye 
contact is important because it engages the audience. It is sometimes help¬ 
ful to select one person in the left of tire audience, one person in the cen¬ 
ter, and one person on the right to serve as your targets. From time to time 
in the presentation, remind yourself to speak to each of these persons as 
you pan the audience with your gaze. If you know your material, this will 
be no more difficult than riding a bicycle and chewing gum. 

Start by introducing yourself and tire title of your presentation. As you 
present your remarks, speak more slowly than you think you have to. Re¬ 
member, you are intimately familiar with this material, but to your audi¬ 
ence it is new. Vary the speed and loudness of your voice; no one likes a 
monotone speaker. 

Think about your body language. Stand in a relaxed manner, rather 
than stiffly. Don’t cross your arms. Don’t lean excessively on the podium. 
Don’t fidget with your hands or anything else—it distracts the audience. 
You don’t need to stand in one place throughout the presentation. Feel 
free to move as much as possible—this will reengage your audience’s in¬ 
terest. It will also make you more comfortable and make the occasion seem 
less formal. 

As you speak, avoid filler words like umm, aadh, and so on. If you have 
practiced your presentation enough you will know the material so you 
won’t have spaces to fill. In any case, speech experts point out that silence 
is less distracting to the audience than filler words. 

It is unprofessional to exceed the time limit. If your presentation is 
overly long, at best you trill have to speed up, possibly omitting explana¬ 
tions, which will reduce the audience’s understanding. At worst, you may 
be stopped before you conclude. 
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End the presentation with a simple “thank you,” and then sit down. Let 
tire chair choose to recognize airy questions from the audience. 

After you complete your presentation, it is likely that you will need to 
field some questions from the audience. Most will be straightforward, ask¬ 
ing for clarification of some point that wasn’t completely understood. Some 
questions may raise issues you hadn’t considered before. This is a good 
tiring because it will help improve your paper. Take time to drink before re¬ 
sponding. If you hadn’t considered tire issue before say so and thank dre 
person for bringing it up. If, after some thought, you can’t think of a good 
response, that’s fine. Simply say, you’ll have to think about it furdrer. 

The Role of the Discussant 

It is common practice at professional conferences for research presen¬ 
tations to be followed by remarks from a discussant. The function of the 
discussant is twofold: first, to offer a well-drought-out, educated reaction 
to a research paper, but second, and more importandy, to give the author 
helpful feedback to improve the paper. In this section, we discuss how to 
serve effectively as a discussant. 

The discussant reads the paper in advance and prepares comments on 
it. 5 Ideally, the discussant will be an expert in the field. If you are not, you 
may need to review the literature in order to be able to talk intelligendy 
about the paper. A good place to start is the key references cited in the pa¬ 
per, especially previous studies that are surveys of the field. 

The discussant usually has only a few minutes to comment on a paper, 
so how should you use that time? To begin, you should briefly summarize 
the paper and highlight dre contribution it attempts to make to the liter¬ 
ature. What does the paper try to accomplish? How does it go about doing 
that? What results are reported? To what extent did the paper succeed in 
its goal? Often, the discussant can provide a better summary of the paper 
than the author does, since he or she is seeing it dispassionately and isn’t 
worried about die details of the work. 

What can you suggest to improve the paper? This is where the discussant 
can really help the author. Could any parts benefit from more detailed ex¬ 
planation? Was there any part that you were unable to understand? Were 
there any problems with the data that weakened the paper’s argument? 
Were there any problems in the statistical tests employed? Were there any 
errors in interpreting die results? Are there any previous studies that the 
author might benefit from reading? 

Specific suggestions are always more helpful than general comments. 
For example, compare, “I liked this paper” witii “A better explanation of 
where the data came from would make the evidence more convincing.” 



The content of the paper is more important tiian the writing, but presen¬ 
tation matters too to the extent tiiat it detracts from die argument. 

In die final analysis, what contribution does the paper make to under¬ 
standing die issue or problem it seeks to explore? One last point: it is a pro¬ 
fessional courtesy to give a copy of your comments to die author when 
you’re done. 


SUMMARY 

O Researchers disseminate their research in the form of written reports 
and oral presentations. 

e The purpose of research reports and presentations is to present a con¬ 
vincing argument that is supported by logic and empirical evidence. 

e A research report includes a tide, an introduction and survey of the lit¬ 
erature, a theoretical analysis of the issue or problem, empirical testing 
of die hypothesis, conclusions, and references. 

® The theoretical analysis in an empirical research paper is more impor¬ 
tant than one might think because it is the theory that provides the 
causal interpretation of the empirical results. Testing without theory can 
also show correlation at best. 

• Interpreting empirical results is more important than presenting them. 

O The conclusion should make the case for die research argument: to what 

extent did the empirical testing confirm: the hypothesis? 

0 A research presentation is best thought of as a summary of a research re¬ 
port. As such, it should be given at a lower technical level mid with fewer 
details. 

s The key to minimizing presentation anxiety is to carefully plan the pre 
sentation and to practice enough times so that you know the material. 

• Never read your presentation! 

e To be most effective, you should complement your presentation with vi¬ 
sual aids. 


NOTES 

1. One caveat is thesis a vriters who have supervisors who may require diem 
to demonstrate mastery of basic concepts. 

2. For more detailed discussion of this, see the last section in Chapter 9, 
“Constructing a Data Appendix to Your Research. 
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3. Ethridge (1995) makes the distinction between findings, which are the 
results of your research, and conclusions, which go beyond your partic¬ 
ular project. 

4. As noted earlier, the exception to this is in a thesis presentation. 

5. Sometimes the author delivers tire paper so late that fire discussant 
doesn t have enough time to adequately review it. In such cases, it is per¬ 
fectly appropriate to indicate that during your comments to tire audience. 

SUGGESTIONS FOR FURTHER READING 

Booth et al - (1995 )—How to draft and revise a research paper; not targeted 

toward economics but useful nonetheless. 

Ethridge' (1995), especially Chapter 10 —How to write a research report in eco¬ 
nomics; designed for graduate students but very readable and useful for 
undergraduates too. 

McCloskey (2000 )—Classic though opinionated style guide for writing eco¬ 
nomics papers. 

McCuen et al. (1993), Chapter 7-Addressed to scientists and engineers but 
still relevant to economists. 

Thomson (2001 )-Addressed to new Ph.D.s but advanced undergraduates 
may derive some insights about scholarly writing in economics. 

Wyrick (1994), Chapter 12 How to write an undergraduate research paper 
in economics. 

EXERCISES 

1. Create an outline of the argument you will make in your research pa¬ 
per: What is tire topic? Why is it worthy of study? In what ways has the 
literature on tire topic failed to completely solve the research problem 
oi answer the research question? How does your research attempt to 
correct this deficiency? What theory does your research apply? What hy¬ 
pothesis did you obtain from this theory and how did you obtain it? 
How did you test the hypothesis? What data did you use? To what extent 
did the test results confirm die hypothesis? 

2. Write a survey of the literature on your research topic. Organize your 
survey so that it provides a justification of the approach you will take in 
your research. 

3. Summarize the theoretical analysis of your research, where the objec¬ 
tive is to explain how the theory led to your hypothesis. 
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Rubric for Grading 
Research Papers 


1. Abstract of the Paper —Does the abstract adequately summarize the 
author’s argument? 

2. Quality of the Research Question —Does the introduction persuade 
the reader that the paper investigates a good research question? Is the 
question interesting and significant? 

3. Quality of the Literature Survey —Does tire literature survey identify 
the major studies to date on the topic? Does it only review studies that 
are relevant to the current paper’s research question? Does it summa¬ 
rize them succinctly and identify their weaknesses? Does it show how 
this l esearch relates to the literature, by improving in some way on 
what has been done previously? 

4. Solid analysis of the issue or problem —Does the paper display a clear 
understanding of how to apply economic theory to tire issue or prob¬ 
lem? Does the analysis lead to tire hypothesis of the paper? Is there a 
clear statement of the research hypothesis? 

5. Valid empirical testing approach —Is the empirical methodology a rea¬ 
sonable test of tire hypothesis? Is tire data adequate for the test? 

6. Reasonable interpretation of the results— Are tire results displayed ap¬ 
propriately? Are tire results correctly interpreted? Do drey support the 
hypothesis? Could they be interpreted in some way that doesn’t sup¬ 
port tire hypothesis. 

7. Overall Argument Based on tire evidence provided and interpreted 
by tire author, is the conclusion convincing? 

8. Tables and Graphs— Are they used effectively? 

9. Reference List —Is the reference list complete, and does it use tire cor¬ 
rect citation style? 

10. Data Appendix —Is tire data appendix complete, correct, and ade¬ 
quately cited? 
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abstract - Summaiy of tire argu¬ 
ment in a scholarly work. Usually 
abstr acts are limited to 
100-200 words. 

ad hoc model - A mathematical 
model that is not derived from 
optimizing principles. Instead, 
the hypothesized relationships 
come from common sense or 
experience. 

annotated bibliography - List of 
references on a topic, each of 
which is briefly described aird 
evaluated. An annotated bibliog¬ 
raphy also includes complete bib¬ 
liographic information on each 
reference. Compare with critical 
review and research abstract. 

annualized growth rate - What tire 
cumulative growth rate over a 
year would be of an actual growth 
rate over a shorter period of 
time. For example, what a 
monthly growth rate would be if 
it continued over twelve months. 

argument - Air assertion or claim 
supported by reasons or evi¬ 
dence. See also sound argument. 

auto, or serial, correlation - Com¬ 
mon problem of regression analy¬ 
sis. Autocorrelation means that 
tire errors in each observation of 


tire data sample are dependent 
on or correlated with each other. 
Autocorrelation artificially 
inflates tire reported t scores, so 
that explanatory variables may 
appeal' to be statistically different 
from zero when they are not. 

Boolean searching - Keyword 
searching using the Boolean 
operators AND, OR, or NOT to 
narrow or widen tire search. 

browsing - Searching for literature 
by manually examining documents 
or bibliographies for references. 
Contrast with keyword searching. 

causal validity - The degree to 
which a test indicates that a rela¬ 
tionship is causal, rather than 
merely correlational. For exam¬ 
ple, to what extent can one be 
sure that X causes Y, rather than 
the converse, or that both Zand 
Y are determined by some third 
variable? Causal validity is one 
dimension of internal validity. 

change, or first difference - The 
difference in values of two adja¬ 
cent observations. With time- 
series data, it is tire subsequent 
value less tire prior value. 

coefficient estimates - See esti¬ 
mated coefficients. 
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confounding, or control, 
variables - In a hypothesized rela¬ 
tionship, confounding variables 
are other variables that affect tire 
variable you are trying to explain. 
In order to correcdy assess the 
hypotiresized relationship, you 
need to control for tire effects of 
those other variables. For exam¬ 
ple, if you believe that consumer 
spending depends on income, to 
assess that relationship, you must 
control for the effects of other 
variables, such as interest rates or 
savings, which might also affect 
consumer spending. See also 
multiple regression analysis. 

correlation - A measure of dre 
degree of linear association 
between two variables. The corre¬ 
lation coefficient ranges from 
positive one (a perfect positive 
relationship between the two vari¬ 
ables) to zero (no relationship 
between the two variables) to 
negative one (a perfect negative 
relationship between the two vari¬ 
ables.) Compare witii covariance. 

covariance - A measure of how two 
variables vary togetiier. It is 
related mathematically to tire 
standard deviation and die vari¬ 
ance. Compare with correlation. 

critical review - The summary and 
evaluation of die argument of a 
research report, along with com¬ 
plete bibliographic information 
to identify the report. Compare 


with annotated bibliography and 
research abstract. 

critical thinking - The evaluation 
of competing arguments based 
on their logic and evidence. 

cross-section data - Different 
observations of a variable at the 
same point in time; for example, 
■ average personal income across 
the fifty states in 2004. Contrast 
with longitudinal data and time- 
series data. 

deductive reasoning - Reasoning 
that starts from one or more gen¬ 
eral principles and derives spe¬ 
cific predictions from them. See 
also valid deduction. Contrast with 
inductive reasoning. 

dependent variable - The concept 
we are trying to explain in a 
hypothesized relationship. Con¬ 
trast with explanatory, or indepen¬ 
dent, variable. 

derived data - Data actually used 
in empirical testing, which have 
been manipulated from the 
source data. Contrast with 
raw data. 

descriptive statistics - Statistics 
diat summarize the underlying 
data. Descriptive statistics include 
measures of central tendency, 
such as die mean, median, and 
mode, and measures of dispersion, 
such as the standard deviation and 
variance. 
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directories - Hierarchically orga¬ 
nized catalogs of information on 
die Internet on a variety of sub¬ 
jects. See also virtual libraries. 

dummy variable - A constructed 
vaiiable in a regression analysis 
to capture qualitative, usually 
binary factors such as gender 
(male/female). Dummy variables 
have values of one (e.g., male) or 
zero (e.g., not male). Dummy 
variables can be used in the con¬ 
struction of an intemction variable. 

economic, or scientific, signifi¬ 
cance - The quality of a relation¬ 
ship that is large enough to 
matter in the real world. See also 
size of an estimated coefficient. Con¬ 
trast with statistical significance. 

editing - Correcting the grammar, 
mechanics, and style of a piece of 
writing. 

endogenous variable - A variable 
that is determined inside the 
model being studied. Contrast 
with exogenous variable. 

estimated coefficients - The prod¬ 
ucts of regression analysis, the esti¬ 
mated coefficients are estimates 
of the relationship between each 
explanatory variable and tire depen¬ 
dent variable, ceteris paribus. See 
also sign of an estimated coefficient. 

explanatory, or independent, vari¬ 
able - A concept that is hypothe¬ 
sized to explain, affect, or cause 
the dependent variable. Indepen¬ 


dent variables can be exogenous or 
endogenous. Contrast with depen¬ 
dent variable; see also multiple 
regression analysis. 

exogenous variable - A variable 
that is determined outside the 
model being studied. As such, it 
is treated as a given in the model. 
Contrast with endogenous variable. 

experimental research - Research 
methodology that is usually char¬ 
acterized by two sample groups: 
an intervention or treatment 
group, and a control group. The 
control group is designed to be 
in every way identical to the treat¬ 
ment group except for the treat¬ 
ment. Thus, if the experiment 
between the two groups yields dif¬ 
ferent outcomes, then that differ¬ 
ence can be ascribed to the 
treatment. Contrast with survey, 
or nonexperimental, research. 

external validity - The degree to 
which the results of a study can 
be generalized to other situa¬ 
tions, applications, or circum¬ 
stances. Contrast with internal 
validity. 

focus - A feature of good writing, 
where the point of tire document 
is crystal clear to tire reader and 
tire argument does not wander 
from this point. 

good argument - See sound 
argument. 

growth rate - See rate of change. 
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heteroskedasticity - A common 
problem of regression analysis 
that occurs when tire errors in 
each observation of tire data sam¬ 
ple, while independent from one 
another, have different variances. 
Heteroskedasticity artificially 
inflates the reported t scores, so 
that explanatory variables may 
appear to be statistically different 
from zero when they are not. 

implicit price deflator - A price 
index constructed by dividing the 
real value of a concept into the 
nominal value, making it an 
implicit price index, or deflator. 
See also price index and unit price. 

independent variable - See 

explanatory variable. 

inference - A conclusion reached 
as the result of reasoning logi¬ 
cally about facts and relation¬ 
ships. See also warranted inference. 

index number - A unit-free 
weighted average designed to 
compare trends in prices or 
quantities since some base 
period. The raw data are trans¬ 
formed so that the value of die 
base period is 100. Two examples 
of index numbers include price 
indices and quantity indices. 

inductive reasoning - Reasoning 
that infers a general rule from 
one or more specific cases or situ¬ 
ations. Contrast with deductive 
reasoning. 


instrument validity - The degree 
to which a test instrument mea¬ 
sures what it purports to. Instru¬ 
ment validity is one dimension of 
internal validity. 

interaction term - A constructed 
variable in a regression analysis 
designed to capture the effects of 
two explanatory variables, for 
example, age and race, interact¬ 
ing. The interaction variable is 
constructed by multiplying die 
two explanatory variables 
together. See also squared term 
and dummy variable. 

internal validity - The degr ee to 
which an impact observed ( Y) can 
be attributed to die study variable 
(X); in other words, given die 
assumptions and evidence, one 
can deduce that X causes Y. See 
also instrument validity, relationship 
validity, and causal validity. Con¬ 
trast with external validity. 

keyword searching - Searching for 
literature using search engines 
on die Internet or on specialized 
electronic databases. Compare 
with phrase searching. Contrast 
with browsing. 

knowledge - The meaning or 
interpretation of evidence. 

level of significance - The risk that 
a researcher is willing to take that 
the null hypothesis will be rejected 
when it is true. Commonly 
accepted levels of significance are 
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1 or 5 percent. A 10 percent level 
of significance is about die 
highest most researchers will 
accept. See also Type 1 error and 
Type 2 error. 

literature (on a subject) - Studies 
to date on a subject, bodi pub¬ 
lished and in working papers. 

literature survey or review - The 
process of identifying the major 
studies to date on a subject, or a 
written summaiy of those studies. 

logical fallacy - An argument that 
is flawed because the conclusion 
does not actually follow from the 
reasons stated, even though die 
argument is phrased in a way that 
makes you think it does. 

longitudinal, or panel, data - A 
sample of data that is essentially a 
cross section followed over time, 
where stricdy speaking die cross 
section contains the same individ¬ 
uals in each time period. Other¬ 
wise, die sample is described as a 
pooled cross section. Contrast 
with cross-section data and time- 
series data. 

maintained hypothesis - The theo¬ 
retical prediction of a model; 
what one hopes to prove when 
conducting a statistical test. If the 
evidence rejects the null hypothe¬ 
sis, dien it confirms the main¬ 
tained hypotiiesis. 

mean - A measure of central ten¬ 
dency, computed as the arith¬ 


metic average. Contrast with 
median and mode. See also descrip¬ 
tive statistics. 

median - A measure of central ten¬ 
dency defined as die middle 
value of a data sample with half 
the observations above die 
median and half die observations 
below. Contrast with mean and 
mode. See also descriptive statistics. 

mode - A measure of central ten¬ 
dency defined as the most fre¬ 
quent value. Contrast with mean 
and median. See also descriptive 
statistics. 

moving average - Data-smootiiing 
technique that replaces the 
actual observation in each period 
with an average of the n -1 pre¬ 
ceding data points. 

mathematical reasoning - An 
approach to theory development 
based on madiematical manipu¬ 
lation of premises to derive for¬ 
mal hypotheses. 

micro data set - Observations of 
individual economic agents such 
as individuals, households, or 
firms. 

multicollinearity - A common 
problem of regression analysis 
that occurs when two or more 
explanatory variables are highly 
con-elated. As a result, die 
reported t statistics are artificially 
low, making it more difficult to 
prove statistical significance. 
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multiple regression analysis - A sta¬ 
tistical method of estimating 
hypothesized relationships while 
controlling for the effects of con¬ 
founding variables. Regression is a 
technique for estimating the 
independent influences of each 
explanatory variable, and thus 
statistically maintaining ceteris 
paribus. 

narrative reasoning - An approach 
to theory development based on 
brainstorming about tire topic. 

nominal data - Values expressed in 
terms of actual market prices. 
Nominal is also known as current 
dollars. Contrast with real data. 

nonscience - Fields of study in 
which arguments are assessed 
through logic and normative 
standards rather than empirical 
evidence. Contrast with science. 

null, or statistical, hypothesis - 
The hypothesis that is actually 
examined in a statistical test— 
for example, that there is no 
relationship between two vari¬ 
ables. Contrast with maintained 
hypothesis. 

optimizing model - A mathemati¬ 
cal model derived explicitly by 
assuming that economic agents 
maximize or minimize an objec¬ 
tive function such as profits, util¬ 
ity, or costs. 

organization - A feature of good 
writing, where a series of nested 


arguments lead logically to the 
diesis as a conclusion. 

parameter estimates - See esti¬ 
mated coefficients. 

per capita - Per person. Some 
quantity (e.g., GDP) divided by 
the population gives the quantity 
per capita (e.g., GDP per capita). 

percentage - The proportion, or 
share, expressed as a fraction of 
one hundred. For example, one- 
half is 50/100, or 50 percent. 

percent change - A change 
expressed as a percentage. See 
also rate of change. 

phrase searching - Keyword 
searching on an exact phrase. 

plagiarism - Taking credit for 
someone else’s words or ideas, 
even when it’s unintentional. 

popular literature - Summaries of 
scholarly studies, often written by 
journalists. Contrast with scholarly 
literature. 

power of a statistical test - The 
probability of correctly rejecting 
a null hypothesis when it is 
not true. 

prewriting - Developing a written 
argument by exploring ideas and 
relationships. Also known as 
exploratory writing. 

price index - The index number that 
is essentially a weighted average of 
tire prices of some type of good or 
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service, where the weights tend to 
be either the corresponding quan¬ 
tities or die budget shares in a 
given year. See also implicit price 
deflator. Contrast with quantity index. 

primary data source - The person 
or organization that collected or 
constructed a data set, often 


range - A measure of dispersion of 
a data sample indicated by the 
lowest value in die sample and 
die highest value. See also stan¬ 
dard deviation, variance, and 
descriptive statistics. 

raw data - The data as they exist 
at die source. Contrast with 


using surveys. See also secondary 
data source. 

proportion - The share or fraction 
of a whole. See also percentage. 

quantity index - The index number 
that is essentially a weighted aver¬ 
age of the quantities of some type 
of good or sendee, where the 
weights are the corresponding 
prices in a given year. Contrast 
with price index. 

random sample of data - A sample 
for which every item in the popu¬ 
lation has an equal chance of 
being selected. An equivalent 
definition is that every possible 
sample of tire same size n has an 
equal chance of being the sample 
selected. A nonrandom sample of 
data doesn’t correctly reflect the 
underlying population of data; 
for example, there could be 
more above- or below-average 
observations than occur propor¬ 
tionately in tire population. See 
also sampling error 

random variation - The variation 
in some data that is not systemati¬ 
cally explainable. 


derived data. 

rate of change - A change 
expressed as a percentage; for 
example, the change divided by 
the initial value. See also percent 
change. 

real data - Values expressed in 
terms of prices as they existed in 
a base period. Real data exclude 
the effects of inflation since the 
base period. Real dollars are also 
known as constant dollars, or 
1992 dollars, where 1992 is the 
chosen base year. Contrast with 
nominal data. 

relationship validity - The extent 
to which one can conclude on 
the basis of an empirical test that 
there is a statistical relationship. 
Relationship validity is one 
dimension of internal validity. 

research - The creation of 
knowledge. 

research abstract - A summary of 
the argument made by a research 
report, along with complete bibli¬ 
ographic information to identify 
the report. See also annotated bib¬ 
liography and critical review. 
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research hypothesis - The pro¬ 
posed answer to a research ques¬ 
tion. The hypothesis is the result 
deduced from a researcher’s 
analysis. It is what is tested in an 
empirical research project. 

research question - The specific 
focus of a research project. 

research topic - The broad 
subject area a research project 
will investigate. 

revising - Reviewing the informa¬ 
tion in an argument, rethinking 
the way it is organized, and 
looking for new patterns of 
meaning; more than just 
editorial correction. 

sampling error - The result of a 
nonrandom sample of data 
where the above-average values of 
some observations are not offset 
by tire below-average values of 
other observations. As a conse¬ 
quence, the data appear to show 
a relationship that does not in 
fact exist. See also random sample. 

scholarly literature - Studies done 
by experts in a field and pub¬ 
lished in professional journals or 
books. Contrast with popular 
literature. 

science - Fields of study in which 
arguments are empirically 
testable. Contrast with nonscience. 

scientific method - The set of pro¬ 
cedures for drawing valid, reli¬ 


able, and objective conclusions 
about scientific hypotheses. 

seasonal adjustment - Data- 
smoothing technique by which 
normal seasonal fluctuations in 
the data have been removed. The 
results show any divergence from 
tire normal seasonal pattern. 

secondary data source - An indi¬ 
vidual or organization that pub¬ 
lishes data that it did not 
produce. Often tire data are eas¬ 
ier to use than the primary data. 
Contrast with primary data source. 

significance (of a research topic or 
question) - The scientific impor¬ 
tance or weight of a research 
topic or question. 

significance level - See level of sig¬ 
nificance. 

sign of an estimated coefficient - 
Important for testing hypotheses 
about relationships between vari¬ 
ables, the (algebraic) sign shows 
whether a relationship is positive 
or negative. See also estimated 
coefficient and size of an estimated 
coefficient. 

simultaneous equations bias - A 
common problem of regression 
analysis. In an ordinary least 
squares regression, all tire 
explanatory variables are assumed 
to be exogenous to the model. In 
other words, the explanatory vari¬ 
ables are hypothesized to affect 
the dependent variable, but the 



converse is not tine: tire depen¬ 
dent variable should have no 
effect on the independent vari¬ 
able. If causation runs both ways, 
die result is simultaneous equa¬ 
tions bias and the coefficient esti¬ 
mates will be biased. 

size of an estimated coefficient - 
The magnitude of a hypothesized 
relationship; tire size is important 
for assessing the economic signifi¬ 
cance of a relationship. See also 
estimated coefficient and sign of an 
estimated coefficient. 

Social Science Citation Index - A 
bibliographic search tool that 
allows backward and forward 
searches chronologically. That is, 
it identifies all the studies cited 
by a given study as well as all tire 
studies that cite it. 

sound argument - A valid deduc¬ 
tion in which all tire premises are 
true. See also argument. 

source data - See raw data. 

specification error - A common 
problem of regression analysis, 
where the empirical model is 
incorrect, for example, because 
of omitting relevant variables or 
including irrelevant ones. In tire 
former case, the estimated coeffi¬ 
cients will be biased and the t sta¬ 
tistics will be artificially reduced; 
in the latter case, tire coefficients 
are unbiased but the t statistics 
are inflated. 


squared term - A constructed vari¬ 
able in a regression analysis 
used to turn a quadratic rela¬ 
tionship that illustrates increas¬ 
ing or diminishing returns into 
a linear relationship. The vari¬ 
able is constructed by squaring 
the base factor, for example, 
Age_squared = age * age. See 
also interaction term. 

standard deviation - A measure of 
dispersion of a data sample 
defined approximately as the 
average amount by which a data 
point differs from the mean value 
of the sample. See also variance 
and descriptive statistics. 

standard error - A measure of dis¬ 
persion in a data sample, normal¬ 
ized by the size of the sample. 

The standard error is defined 
as the ratio of the standard 
deviation to the square root of 
the sample size. 

statistical hypothesis - See null 
hypothesis. 

statistical hypothesis testing - 
Using methods of statistical infer¬ 
ence to test hypotheses about 
relationships between variables. 
Typically, statistical methods are 
used to differentiate between 
alternatives called the null hypoth¬ 
esis and the maintained hypothesis. 
Commonly used tests include tire 
f-test and the T-test. See also statis¬ 
tical significance. 
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Glossary- 


Statistical significance - The qual¬ 
ity of a statistical relationship that 
is strong enough to reject the 
conclusion that it is spurious, due 
entirely to random chance in the 
data. See also statistical hypothesis 
testing. Contrast with economic 
significance. 

survey, or nonexperimental, 
research - Survey research 
involves the passive observation 
and analysis of events as they 
occur in nature, while using sta¬ 
tistical methods to account for 
outside factors. Contrast with 
experimental research. 

tune-series data - Different obser¬ 
vations on the same variable at 
different points across time; for 
example, U.S. personal income 
for the period 1954-2004. Con¬ 
trast with cross-section data and lon¬ 
gitudinal data. 

time trend - A constructed variable 
in regression analysis used to cap¬ 
ture the effects of technological 
improvement or other trends 
over time. 

Type 1 error -Wrongly rejecting 
the null hypothesis. Contrast with 
Type 2 error. 

Type 2 error - Wrongly accepting 
the null hypothesis. Contrast with 
Type 1 error. 


unit value - An estimate of the 
price of something defined as the 
value of shipments divided by tire 
volume of shipments, or the 
nominal value divided by the real 
value. Compare with implicit price 
deflator. 

valid deduction - A deduction in 
which the conclusion must follow 
from the premises. 

validity - See internal or external 
validity. 

variance — A measure of dispersion 
of a data sample defined as the 
square of the standard deviation. 
See also descriptive statistics. 

virtual libraries - Internet directo¬ 
ries compiled by librarians or 
other experts. As such, they tend 
to have better sources of informa¬ 
tion than directories. 

warranted inference - A conclu¬ 
sion that follows from the evi¬ 
dence. See also inference. 

warrants — Unstated or underlying 
assumptions on which an argu¬ 
ment stands. 

writing to learn — The idea that 
writing is more than a product; it 
is also a process or a tool for 
working through and making 
sense out of ideas you don’t fully 
understand. 
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